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INCREASING THE TREHALOSE CONTENT OP ORGANISMS BY TRANSFORMING 
THEM WITH COMBINATIONS OF THE BTRPCTPRAI. GENES TOR TREHALOSE 
SYNTHASE. 

5 FIELD OF INVENTION 

f 

~" " This invention -relates- to the jgenetic engineering of the 

trehalose synthetic pathway of yeasts, such as baker's and ~ "'" 
distiller's yeasts, and to the transfer of this pathway by 

10 genetic engineering to other organisms. In particular, the 
present invention concerns trehalose synthase, novel genes 
encoding the trehalose synthase, novel vectors containing the 
novel genes, and host cells and organisms transformed with the 
novel vectors. The invention also relates to the production of 

15 trehalose and ethanol and to the improvement of the stress 

resistance of organisms, in particular yeasts and crop plants. 

BACKGROUND OF THE INVENTION; 

20 it is well known that sugars and other polyhydric compounds 

stabilize isolated proteins and phospholipid membranes during 
dehydration, probably by replacing the water molecules that are 
hydrogen-bonded to these macromolecules [reviewed by Crowe, 
J.H. et ah (1987) Biochemical Journal 242, 1-10]. Trehalose 
(a-glucopyranosyl-a-D-glucopyranose) is a dimer of two glucose 
molecules linked through their reducing groups. Because it has 
no reducing groups, it does not take part in the Maillard 
reactions that cause many sugars to damage proteins, and it is 
one of the most effective known protectants of proteins and 
> biological membranes in vitro . 

In nature, trehalose is found at high concentrations in yeasts 
and other fungi, some bacteria, insects, and some litoral 
animals, such as the brine shrimp. It is notable that all these 
organisms are frequently exposed to osmotic and dehydration 
stress. Accumulation of trehalose in higher plants is rare, but 
high levels occur in the so-called resurrection plants, such as 
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the pteridophyte, Selaoinella lepidophvlla . which can survive 
extended drought [Quillet, M. and Soulet, M. (1964) Comptes 
Rendus de l'Academie des Sciences, Paris 259, pp. 635-637; 
reviewed by Avigad, G. (1982) in Encyclopedia of Plant Research 
5 (New Series) 13A, pp. 217-347] . 

A decreased availability of intracellular water to proteins and 
membranes is a common feature not only of dehydration and 
osmotic stress, but also of freezing,, in which- ice formation 
withdraws water from inside the cells, and heat stress, which 
weakens the hydrogen bonds between water and biological 
macromolecules. In recent years several publications have shown 
a close connection between the trehalose content of yeast 
cells, especially of the species Saccharomvces cerevisiae . and 
their resistance to dehydration and osmotic, freezing and heat 
stresses. This work has lead to the concept [summarized by 
Wiemkem, A. (1990) Antonie van Leeuwenhoek 58, 209-217] that, 
whereas the main storage or reserve carbohydrate in yeast is 
glycogen, ' the prime physiological function of trehalose is as a 
protectant against these and other stresses, including 
starvation and even poisoning by copper, ethanol and hydrogen 
peroxide, which all stimulate trehalose accumulation [Attfield, 
P.V. (1987) Federation of European Biochemical Societies 
Letters 225, 259-263]. 

Thus, during growth of Saccharomvces cerevisiae on glucose, 
glycogen begins to accumulate about one generation before the 
glucose is exhausted, and begins to be steadily consumed as 
soon as all external carbon supplies are exhausted. In 
contrast, accumulation of trehalose (partly at the expense of 
glycogen) only begins after all the glucose has been consumed/ 
and the trehalose level is then maintained until nearly all the 
glycogen has been consumed [Lillie, S.A. & Pringle, J.R. (1980) '< 
Journal of Bacteriology 143, 1384-1394]. The eventual 
consumption of trehalose is accompanied by a rapid decrease in 
the number of viable cells. 

When trehalose levels in S. cerevisiae have been manipulated by 
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varying the growth conditions or administering heat shocks, 
high positive correlations have been found between the 
trehalose content of the cells and their resistance to 
dehydration [Gadd, 6. et al. (1987) Federation of European 
Microbiological Societies Microbiological Letters 48, 249-254], 
heat stress [Hottiger, T. et al., (1987) Federation of European 
Biochemical Societies Letters 220, 113-115] and freezing 
[Gelinas, P. et al. Applied and Environmental Microbiology 55, 
2453-2459]. Also, strains of S. cerevisiap and other yeasts 
selected for resistance to osmotic stress [D'Amore, T. et al. 
(1991) Journal of Industrial Microbiology 7, 191-196] or high 
performance in frozen dough fermentation [Oda, Y. (1986) 
Applied and Environmental Microbiology 52, 941-943] were found 
to have unusually high trehalose contents. Furthermore, a 
mutation in the cyclic AMP signaling system of S. cerevislap 
that causes constitutive high trehalose levels also causes 
constitutive thermotolerance, .whereas another mutation in this 
system that prevents the usual rise in trehalose during heat 
shock also prevents the acquisition of thermotolerance 
20 [Hottiger, T. et aJU^ (1989) Federation of European Biochemical 
Societies Letters 255, 431-434]. Thus, there is much evidence 
pointing to a connection between trehalose content and . stress 
resistance in yeasts, especially S. carevisiac . similar 
findings have been made for several other fungi [e.g., Neves, 
5 M.J., Jorge, J. A., Francois, J. M. & Terenzi, H.F. (1991) 
Federation of European Biochemical Societies Letters 283, 
19-22]. However, a causative relationship has not yet been 
demonstrated. Further, nearly all conditions that cause 
increases in the trehalose content of yeast also cause 
increases in the levels of the so-called heat shock proteins. 
The 1989 publication of Hottiger and colleagues, 
cited above, claims that canavanine does not cause ah increase 
in either trehalose levels or thermotolerance, whereas this 
compound is reported to induce heat shock proteins. 



Whether or not there is a causal relation b tween trehalose 
content and stress resistance, it has become general practice 
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in the manufacture of baker's yeast to maximise the trehalose 
content of the yeas£. Various maturation processes have been 
developed to achieve this aim, and they are of crucial 
importance in the manufacture of active dried yeast. The 
5 details of these processes are often secret, but they are 

generally empirical regimes in which carbon and nitrogen feeds , 
aeration and temperature are carefully controlled and selected 
strains, of yeast- are used.- They demand time~ and "energy inputs 
during which little increase in cell number occurs. A more 
10 rational and controlled process would be of economic benefit. 

According to Cabib, E. & Leloir, L.F. [(1957) Journal of 
Biological Chemistry 231, 259-275] f trehalose is synthesized in 
yeast from uridine diphosphoglucose (UDPG) and glucose- 

15 -6-phosphate (G6P) by the sequential action of two enzyme 
activities , trehalose-6-phosphate synthase and trehalose- 
-6-phosphate phosphatase. Londesborough, J.& Vuorio, 0. [(1991) 
Journal of Microbiology 137, 323-330, expressly incorporated 
herein by reference] have purified from baker's yeast a 

20 proteolytically modified protein complex that exhibited both 
these activities and appeared to contain a short polypeptide 
chain (57 kDa) and two truncated versions (86 kDa and 93 kDa) 
of a long polypeptide chain. The intact long chain was 
estimated to have a mass of at least 115 kDa. It was not 

25 disclosed which enzyme activity or activities was associated 
with which polypeptide, nor indeed whether both polypeptide 
were essential for either or both enzymatic activities. 
Anti-sera against both polypeptides were reported, but no amino 
acid sequences were disclosed. 

30 

An earlier patent application [EP 451 896; see Claim 1] has 
claimed for a transformed yeast "comprising. . . . one gene 

encoding trehalose-6-phosphate synthase". However, no 

information about the either the gene' or the protein it encodes 
35 was provided. 



Several authors have reported increases in TPS activity in 
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conditions that lead to accumulation of trehalose by S. 
cerevisiae. and Schizosaccharomvces pombe both during the 
approach to stationary phase [Winkler, K. , et al. (1991) 
Federation of European Biochemical Societies Letters 291, 269- 
5 272; Francois, J., et al. (1991) Yeast 7, 575-587] and after 
temperature shift-ups to about 40 °C [De Virgilio, C, et al, 
(1990) Federation of European Biochemical, Societies Letters 
273, 107-110] . Panek and her colleagues [Panek, A.C. . et al. " 
(1987) Current Genetics 11, 459-465] have claimed that TPS 
10 activity is increased by dephosphorylation of pre-existing 
enzyme molecules, i.e., that it is the result of 
post-trans lational regulation. This claim has been challenged 
[Vandercammen, A., et al. r (1989) European Journal of 
Biochemistry 182, 613-620] but continues to be made [Panek, 
15 A.D. & Panek, A.C. (1990) Journal of Biotechnology 14, 

229-238]. Evidence for or against an increase in the amount of 
enzyme during trehalose accumulation is conflicting. Inhibitors 
of mRNA synthesis inhibited trehalose accumulation by S. 
cerevisiae shifted from 30 to 45 °c [Attfield (1937) loc.cit. ], 
20 whereas under very similar conditions Winkler et al [(1991) 
loc.cit.] found that cycloheximide (an inhibitor of protein 
synthesis) did not prevent the accumulation of trehalose, 
which, however, occurred without an observable increase in TPS 
activity. In a lower temperature range (a shift from 23 to 36 
25 °C) , trehalose accumulation was accompanied by a three-fold 
increase in TPS activity, and cycloheximide prevented the 
increase in TPS [Panek, A.C, et al. (1990) Biochemie 72, 
77-79]. In Schizos accharomvces pombe . [De Virgilio, C. , et al. 
(1991) loc. cit.] temperature shiftup caused a large 
30 accumulation of trehalose and increase of TPS which were not 

prevented by cycloheximide, leading the authors to suggest that 
in this yeast a post-translational activation occurs. We now 
disclose that in S. cerevisiae the co-ordinate increases in TPS 
and TPP activities during exhaustion of glucose are accompanied 
5 by an increase in antigenic material recognized by anti-sera to 
the short and long chains of a purified trehalose synthase. 
Hence, a method to increase the trehalose content of cells, and 



WO 93/17093 



PCT/FI93/00049 



so, their stress tolerance, would be to isolate, clone, and 
modify the' structural genes (hereinafter referred to as TSS1, 
TSL1, and TSL2) of these polypeptides and cause their 
expression in yeast or other host cells under the control of 
suitable promoters. If the expression of these genes could be 
controlled, then so could the trehalose content of the host 
cells. 



The well known metabolic theory of Kacser & Burns [(1973) 

10 Symposium of the Society of Experimental Biology 27 , 65-107] 
teaches that in principle the concentration of any 
intermediate, such as trehalose, can be increased by increasing 
the amount of any enzyme synthesizing it or decreasing the 
amount of any enzyme degrading it, but that the size of the 

15 increase may not be significant. The novelty of the present 
invention- lies in the identification and characterization of 
the particular yeast genes that must be modified to increase 
the amounts of trehalose synthase and the recognition of the 
advantages of modifying the synthetic pathway rather than the 

20 degradative pathway. These advantages include (i) leaving the 
highly regulated [see, e.g., Thevelein, J.M. (1988) 
Experimental Mycology 12, 1-12] degradative pathway intact to 
avoid the physiological problems likely in yeast that cannot 
activate this pathway, (ii) the possibility of causing yeast to 

25 synthesize trehalose under physiological conditions where wild 
type yeasts do not (so that blocking the degradative pathway 
cannot increase the amount of trehalose) and (iii) the 
important possibility of introducing by these genes a 
trehalose-synthetic capacity to organisms, such as most higher 

30 plants, that naturally lack this capacity. 

Expression of the genes for trehalose synthesis in yeast under 
conditions where trehalase is active will increase the 
operation of a so-called "futile" cycle, in which glucose is 
35 continuously phosphorylated , converted to trehalose and 
regenerated by hydrolysis of the trehalose, resulting in 
increased consumption of ATP. This ATP must be regenerated, and 
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under fermentative conditions this will occur by conversion of 
sugars into ethanol. Therefore, introduction of TSS1, TSL1 and 
TSL2 into yeast under the control of promoters active under 
fermentative conditions is expected to decrease the yield of 
5 cell mass on carbon source and increase that of ethanol. The 
many attempts [e.g., Schaaf et al. (1989) Yeast 5, 285-290] to 
increase fermentation rates in yeast by increasing the levels 
of glycolytic- enzymes have been-unsuecessful.- The probable " 
reason is that availability of ADP limits the rate of 

10 glycolysis in yeast. Introduction of a futile cycle-ATPase is 
thus expected to increase this rate. The feasibility of this 
invention is demonstrated by the finding [Gancedo, J.-M. & 
Navas, M.A. (1992) Yeast 8 S574] that expression of the 
gluconeogenic enzymes, fructose bisphosphatase and phosphoenol- 

15 pyruvate carboxykinase under fermentative conditions (so 
causing two futile cycles) caused a 50 % increase in the 
fermentation rate of resting yeast. Use of the trehalose futile 
cycle has the added advantage that the cells must then contain 
a steady state level of trehalose, which increases their 

20 tolerance to osmotic and temperature stress. 

The present invention includes transformed strains of 
distiller's yeast, in which the presence of modified forms. of 
any or all of TSSl, TSL1 and TSL2 results in an increased yield 
25 of ethanol from carbohydrate sources. 

As well as being used to improve the properties of yeast, 
especially active dried yeast and yeast for frozen doughs, this 
invention has other obvious applications. First, by increasing 

30 the proportion of trehalose in yeast, the industrial scale 

production of trehalose from yeast is made more economic. It is 
particularly advantageous to obtain trehalose from yeast 
because, since yeast is a traditional and safe food stuff/ a 
minimal purification of the trehalose will often be adequate: 

35 preparations of trehalose containing yeast residues could be 
safely added to food stuffs for human or animal consumption. 
Trehalose also has medical applications, both as a stabilizer 
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of diagnostic kits, viruses and other protein material [WO 
( 87/00196] and, potentially, as a source of anti-tumour agents 
[Ohtsuro et al. (1991) Immunology 74, 497-503]. Trehalose for 
internal applications in humans would be much more safely 
5 obtained from yeast than from a genetically engineered 
bacterium. 

.Second, by transferring these genes to higher plants after 
making suitable modifications obvious to anyone skilled in the 
art (in general , replacements of adenine/ thymine base pairs by 
guanine/cytosine base pairs as suggested by Perlak et al. 
[(1991) Proceedings of the National Academy of Sciences of the 
U.S.A. 88, 3324-3328] and the introduction of suitable 
promoters, some of which may be tissue-specif ic, to direct the 
synthesis of trehalose to frost and drought-sensitive tissues), 
the resistance of the plants to various stresses , especially 
frost and dehydration, should be improved. The economic 
importance of such improvements is potentially enormous, 
because even small increases in cold-tolerance will lead to 
large increases in growing season, whereas dehydration 
resistance can save entire crops in time of drought. Frost and 
drought resistance in higher plants is usually accompanied by 
increases in compounds such as proline rather than trehalose 
[reviewed by Stewart (1989) in "Plants under Stress", pp 
115-130], but, as mentioned above, resurrection plants 
accumulate large amounts of trehalose and there seems, a 
priori, to be no reason why this strategy should not be 
successful. Therefore, the present invention includes a process 
to transform crop plants by introducing recombinant forms of 
the structural genes of yeast trehalose synthase (TSS1, TSL1 
and TSL2) so as to increase the trehalose content of some of 
their tissues compared to those of the parent plant. Such 
transformed plants can also be economic and safe sources of 
trehalose. Third, the shelf-life of food products can be 
increased by adding trehalose to them [WO 89/00012]. A further 
aspect of the present invention is a novel process for 
producing trehalose-enriched food products from plants by 
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causing them to express the structural genes for yeast 
trehalose synthase in their edible tissues. 

SUMMARY OF INVENTION; 

The present invention provides two isolated genes encoding, 
respectively a short and a long chain of yeast trehalose 
synthase and a_third geneencoding-a -99 kDapolypeptide~that ~ 
occurs in some trehalose synthase preparations and has 
trehalose-6-phosphatase activity. These genes can be used to 
transform an organism (such as a yeast, other fungus or higher 
eukaryote), whereby the transformed organism produces more 
trehalose synthase resulting in a trehalose content higher than 
that of the parent organism. The higher trehalose content 
15 confers improved stress resistance and storage properties on 
the transformed organism as compared to the parent organism, 
and the transformed organism can be used to provide large 
quantities of trehalose. Thus, a process for producing a crop 
plant which has increased resistance to water deprivation, heat 
and cold, comprises transforming the plant by introducing at 
least one of the novel genes into the plant's tissue. 

The invention also provides a trehalose synthase which exhibits 
trehalose-6-phosphate synthase activity activatable by 
fructose-6-phosphate and also trehalose-6 -phosphatase activity. 

Finally, the invention provides processes for producing 
trehalose by cultivating a host or an organism which has been 
transformed with at least one of the novel genes, and processes 
for producing trehalose enriched food products from plants by 
introducing at least one of the novel genes and allowing said 
genes to express the trehalose synthase in the edible tissues 
of the plant. 
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BRIEF DESCRIPTION OF FIGURES 

Fig. 1. SDS-PAGE of intact trehalose synthase 

5 A 6-13 %T gradient gel was used. Lane 1 contains 8.3 ng of 

intact trehalose synthase eluted from the UDPG-Glucuronate- 5 
Agarose column with 0.2 K NaCl (#n of Table 1). Lanes 2, 3 and 
4 contain, -respectively, 7.7, 12 and 1.-0 fig of enzyme eluted 
from the column with 0.4 M NaCl containing 10 mM DDPG (#13, #14 
and #15 from Table 1) . Lane 5 contains about l jug each of 
molecular mass markers (myosin, B-galactosidase, 
a-phosphorylase, BSA, ovalbumin, lactate dehydrogenase, 
triosephosphate-isomerase, myoglobin and cytochrome c) . The 
major polypeptides of intact trehalose synthase are named on 
the left and the molecular mass calibration , in kDa, is shown 
on the right. 

F ig- 2. SDS-PAGE of im munoprecipitates of wild-tvpa yeast grown 
on YP/2 % glucose 

A 9 %T gel was used. Lane 1 contains about 1 ng each of the 
molecular mass markers used in Fig 1. Lanes 2, 3 and 4 contain 
immunoprecipitates from 3.8 mg fresh yeast harvested after 16.1 
h (1,2 % residual glucose), 18.1 h (no residual glucose) and 39 
h. The molecular mass calibration is shown on the left and the 
major polypeptides of trehalose synthase and the heavy chain of 
y-globulin are shown on the right. 

Fi 9- 3 - The promoter and t erminator of TSS1 and the amino acid 
30 sequence deduced from its ORF. ft 

(a) In the promoter and terminator regions, the start ATG and ^ 
tandem TGA stop codons are double underlined and a TATA box and 
putative catabolite repression element are underlined, (b) In 
35 the amino acid sequence (SEQ ID NO: 2) , the sequences found in 
peptides isolated from the short chain of trehalose synthase 
are underlined. 
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Fig* 4. The pro moter and terminator regions of TSL1 and the 
amino acid sequence deduced from its ORF. 
(a) In the promoter region, the start ATG codon is double 
underlined and two TATA boxes and six putative heat shock 
5 elements are underlined. A putative MIG1 binding site is 
overlined. In the terminator region, the TAA stop codon is 
double underlined and a putative transcription termination 
element is underlined . Love r case letters show the end of the-- 
terminator region of the ARGRII gene, which has opposite 
10 polarity, (b) In the amino acid sequence (SEQ ID NO: 82), 

sequences found in peptides isolated from (fragments of) the 
123 kDa long chain are underlined, and those from peptides 
liberated from intact trehalose synthase by limited digestion 
with trypsin are underlined and bold, 

15 

Fig. 5. Alignment of the amino acid sequences of the short and 
long chains of tr ehalose synthase 

The complete short chain sequence (SEQ ID NO: 2; the upper 
sequence) is aligned against residues 320 to 814 of the 123 kDa 
long chain (SEQ ID NO: 4; the lower sequence). 32 gaps are 
introduced to optimize the alignment. Vertical dashes indicate 
identical residues. Colons indicate conservative substitutions. 

Fig. 6. Important restr iction sites in TSS1 and TSL1 

The heavy lines indicate open reading frames. The scale bar 
shows one kb. 

Fi 9- 7 - Synthesis of r 14 Cl - trehalose from ru- 14 Cl -glucose 
6-phosphate bv an extract of wild-type veast 

Reaction mixtures (100 /xl) contained 40 mM HEPES/KOH pH 6.8, 1 
mg BSA/ml, 10 mM MgCl 2 10 mM [U- 14 C]-G6P (736 c.p.m./nmol) and 
(a) no phosphate or (b) 5 mM K phosphate pH 6,8 and (0) 5 mM 
UDPG, (•) 2.5 mM ADPG or (□) neither UDPG nor ADPG . Reactions 
were started by adding 10 Ml (equivalent to 94 /zg fresh yeast) 
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of a 28,000 g supernatant of stationary phase X2180. Reactions 
were stopped by transfer to boiling water for 2 mjln and 
addition of 1.0 ml of a slurry of AG1-X8 (formate) anion 
exchange resin [Londesborough & Vuorio (1991) loc. cit. ] . The 
radioactivity in the resin supernatant was measured. 

Fig. 8. Western analysis of Kla 102 and X2180 yeasts 



Growth of the yeasts is described in Example 7. The loads of 
fresh yeast per lane were: lane 1, 200 fig X2180/2; lanes 2 and 
5, 330 fig 2669/1: lanes 3 and 6, 610 fig 2669/2; lanes 4 and 7 , 
810 iig 2670/1+2; lane 8 f 560 X2180/1 and lane 9, 280 fig 
X2180/1. The blot was probed with anti-TPS/P serum at a 
dilution of 1/30,000* Major bands of trehalose synthase are 
identified on the right. 

Fig. 9- Treatmen t of truncated trehalose synthase with L9 mM 
SM 

Truncated enzyme (0.13 TPS units/ml a 43 fig/ml) in 2 mg BSA/ml 
50 mM HEPES pH 7.0 containing 67 mM NaCl, 0.2 mM EDTA, 0.17 mM 
dithiothreitol, 0.17 mM benzamidine and 1.7 mM UDPG was 
incubated at 24 °C with (closed symbols) or without (open 
symbols) 1.9 mM NEM. TPS (i.o] and TPP (■,□) activities were 
measured. . 

Pig. 10. Autoradioaram of truncated trehalose synthase labelled 
with f 14 C1-NEM and separated by SDS-PAGE 

Labelling was performed as described in Example 8 for 1.5, 
10. 5 f 63 and 190 min in lanes 1, 2, 3 and 4, respectively. The 
positions of the (57 kDa) short chain, 93 and 86 kDa long chain 
fragments and the carrier BSA are indicated. 
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Fig. 11. Treatment of truncated trehalose synthase with ethvl- 
labelled NEM . 

Truncated enzyme (7.2 TPS units/ml « o.24 mg/ml) in 1 mg BSA/ml 
5 25 mM HEPES pH 7.0 containing 2 mM MgC12, 1 mM EDTA and 0.2 M 
NaCl was incubated at 23 °C with (solid symbols) or without 
(open symbols) 32 /zM ethyl -label led NEM. TPS (i,o] and TPP 
(■#_□) .activities. and_ the amounts_of_ [J^C] -NEM -incorporated— into 
the 93 (a) , 86 (+) and 57 (X) kDa polypeptides were measured. 
10 0*1 mol NEM incorporated per mol (150 Kg) of enzyme corresponds 
to an excess radioactivity of 75 c.p.m. in bands cut from the 
gel. 

Fig- 12. Stoichiometrv of NEM labelling 

15 

Residual TPP activity is plotted against the amount of NEM 
incorporated to the 93 and 86 kDa fragments of the long chain. 
Ring-labelled (•) and ethyl-labelled (0) NEM were used. 

20 Fig. 13. SDS-PAGE analysis of fractions eluted from the 
cellulose-phosphate with b uffer containing 0.3 % Triton 

Lane L contains 47 (il of the intact trehalose synthase applied 
to the column. Lane M contains about l /xg each of the molecular 

25 mass markers used in Fig 1. The numbered lanes contain 33 fil of 
selected 1.5 ml fractions eluted from the column . The NaCl 
gradient began to appear in fraction 6 and reached 300 mM at 
fraction 27. A step to 600 mM NaCl emerged between fractions 36 
and 37. Fractions 40 to 42 were eluted with 200 mM K phosphate. 

10 The major bands in the trehalose synthase preparation are 
identified on the left. Details are given in Example 9. 

Fi 9* 14. In vitro activat ion of trehalose synthase bv limited 
tryptic digestion 

15 

Intact trehalose synthase was incubated with (solid symbols) 
and without (open symbols) trypsin and its TPS activity 
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measured in the presence of 5 mM F6P in reaction mixtures, 
containing (0, ) no phosphate or (■) 5 mM K phosphate pH 6,8. 
Details are given in Example 10. 

Fig. 15. Limited trvptic digestion of intact trehalose synthase 

Lane 1 contains the untreated trehalose synthase used in Fig. 
. 15 and lane 2 the same amount. of_ enzyme after -48- min- treatment — 
with trypsin. Lane 3 contains molecular mass standards. The 
major polypeptides of trehalose synthase are identified on the 
left. 

Fig. 16. The effect of fructose 6-phosohate on the TPS activity 
of intact trehalose synthase, at different phosphate 
concentrations 

The TPS activity of native trehalose synthase was measured 
between zero and 10- mM F6P. other conditions were as in the 
standard TPS assay with (•) no changes, (0) 1.3 mM K phosphate 
pH 6.8 added or (■) 4 mM K phosphate pH 6.8 and 0.1 H KC1 added 
and the MgCl 2 concentration decreased to 2.5 mM. Activities are 
shown as percentages of that in the standard assay (i.e., at 5 
mM F6P and no phosphate) . 

Fig. 17. Activation of the TPP activities of intact and 
truncated trehalose synthase bv phosphate 

TPP activities were measured at 0.5 mM [ 14 C]-trehalose-6- 
phosphate in assay mixtures containing 50 mM Hepes pH 6.8, 1 mg 
bovine albumin/ ml and the indicated concentrations of K 
phosphate pH 6.8 and are shown as percentages of the standard 
TPS activity. Initial rates are shown for the (I) intact and 
(•) truncated enzyme. Rates during the second five minutes of 
the accelerating reaction obtained with truncated enzyme are 
also shown (O) . 
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Fig. 18. Phosphate-dependence of the TPP activity of intact 
trehalose synthase 

The reciprocal of the increase in rate (VJ caused by the 
5 phosphate is plotted against (0) [phosphate]" 2 or (#) 

[phosphate]' 1 . V A is shown as a percentage of the standard TPS 
activity. 

Fig. 19. Wes tern analyses of E. coli transformed with TSS1 and 
10 TSL1 

The gels were loaded with samples of whole homogenates (HOM) 
equivalent to 300 ± 12 fig fresh cells or 28 O00 g supernatants 
(SUP) equivalent to 340 ± 25 fig fresh cells. The letters above 

15 the lanes indicate the cell types: K, control (HB101) cells; L, 
ALK03568 (HB101 transformed with TSL1) ; S, ALK03566 "(HB101 
transformed with TSS1) . Gel 1 was probed with anti-57K serum 
(1/20 000) and gel 2 with anti-93K serum (1/20 000). The 
positions of the 57 kDa short chain and about 60, 36 and 3 5 kDa 

20 fragments of the 123 kDa long chain are shown. Molecular mass 
standards are labelled in JcDas. 

Fig. 20. Plasmids containing TSS1 and TSLl 

25 pBluescript containing (a) TSS1 with its own promoter, (b) TSS1 
without its promoter and (c) TSL1 with its own promoter are 
shown. 

Fig. 21. Southern analysis of two tssl disruptants of 
30 S.cerevisiae. 

Clal digests of DNA from control yeast (S150-2B; lanes 2,5 and 
9), and two tssl disruptants, ALKO 3569 (lanes 3, 6 and 10) and 
ALKO 3570 (lanes 4,7 and 11) were probed with TSS1 (lanes 2 to 
35 4), LED2 (lanes 5 to 7) and TSL1 (lanes 9 to 11) . Lanes 1 and 8 
contain DNA standards 
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DETAILED DESCRIPTION OF THE INVENTION: 

In the following description, trehalose-6-phosphate synthase 
5 (TPS) and trehalose-6-phosphate phosphatase (TPP) refer to 

catalytic activities , not to proteins, unless specifically * 
stated otherwise, whereas trehalose synthase refers to a 
protein that can covert uridine diphpsphoglucose_ (UDP_G)_and 
glucose-6-phosphate (G6P) into trehalose, and also exhibits as 

10 partial reactions TPS and TPP activities. TSS1, TSL1, and TSL2 
are structural genes that encode, repectively, the short (57 
kDa) and the about 130 and 99 kDa long chains of trehalose 
synthase. It is well known that mutations occur in genes and 
can cause changes in the amino acid sequence of the encoded 

15 polypeptide. Changes can also be introduced by genetic 

engineering techniques. As used herein, the term TSS1 (or TSL1 
or TSL2) gene includes all DNA sequences homologous with the 
sequences herein disclosed for TSS1 (or TSL1 or TSL2) and 
encoding polypeptides with the functional or structural 

20 properties of the 57 kDa (or about 130 kDa or 99 kDa, 

respectively) polypeptide. Sequences articially derived from 
these genes but still encoding polypeptides with the desired 
functional or structural properties are also included. 

25 The present inventors previously reported the isolation of a 
partially degraded protein preparation that contained a short 
(57 kDa) polypeptide chain and two fragments (86 and 93 kDa) of 
a long polypeptide chain and possessed both TPS and TPP 
catalytic activities [Londesborough, J & Vuorio, O. (1991) 

30 Journal of General Microbiology 137, 323-330]. The size of the 
full-length intact long chain, from which both the 86 and 93 
kDa fragments were then believed to be derived, and whether one 
or other polypeptide possessed one or oth x f of the catalytic 
activities were not known at that time. 

35 

The inventors have now isolated an undegraded trehalose 
synthase that contains the 57 kDa short chain , and two long 
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10 



chains of about 130 kDa and 99 kDa as its major polypeptides. 
Traces of other polypeptides are also present that appear to be 
degradation products of the about 130 and 99 kDa chains. Two 
genes, TSS1 and TSL1, that encode, respectively, the short and 
about 130 kDa long chains have been cloned and sequenced. 
Because the size of this long chain is now known from its gene 
to be 123 kDa, it is hereafter called the 123 kDa long chain. 
TSSl encodes a polypeptide with a theoretical molecular- weight 
of 56.2 kDa; however, this short chain and the 99 kDa long 
chain .are still called after their apparent molecular weights 
by SDS-PAGE analysis, the error in such analyses being at least 
± 10 kDa at 130 kDa. 

The sequences of TSSl and the polypeptide it encodes are 
15 disclosed as SEQ ID NOS:l and 2, respectively. The sequences of 
TSLl and the polypeptide it encodes are disclosed as SEQ ID 
NOS:83 and 82, respectively (earlier versions of these 
sequences, lacking the 5'- and N- terminal regions, are listed 
as SEQ ID N0S:3 and 4). Genetic evidence is disclosed that 
shows that a functional TSSl gene is involved in the expression 
of both TPS and TPP catalytic activities in S. cerevi.i.p . (1) 
both activities are absent from a mutant strain (Klg 102) that 
lacks a properly functional TSSl gene and does not express the 
short chain in a form recognizable in Western blots although it 
25 does express immunologically recognisable long chain; (2) 

disruption of TSSl eliminates TPS and TPP activities, abolishes 
the short chain signal from Western blots and prevents the 
accumulation of trehalose, and these defects are simultaneously 
reversed by transformation with TSSl, which also increased the 
30 resistance of the cells to freezing stress; and (3) 

transformation of Escherichia m n with TSSl causes a large 
increase in the TPS activity of the transformants (but no 
detected increase in their TPP activity) . 

35 We disclose biochemical evidence that the TPP catalytic 
activity of a truncated trehalose synthase requires a 
functional long chain: incorporation of about 1 mole of 



20 



WO 93/27093 



PCT/FI93/OO049 



18 

u C-N"-ethylmaleimide into the 93 kDa long chain fragment per 
mole of truncated trehalose synthase results in complete loss 
of TPP activity but only a slight loss of TPS activity. 
Furthermore, we have been able to isolate the 99 kDa 
5 polypeptide and show that it possesses residual TPP activity 
but no TPS activity. Also, intact trehalose synthase is 
partially resolved into a 99 kDa-enriched form with a 
relatively high TPP/TPS ratio and a 123 kDa-enriched form with 
a lower TPP/TPS ratio. However, truncation of the 123 kDa long 

10 chain has dramatic and important effects on the TPS activity of 
trehalose synthase: removal of the N-terminal 330 or so amino 
acids decreases the sensitivity of the TPS catalytic activity 
to inhibition by phosphate and almost eliminates its activation 
by fructose-6-phosphate. Further, transformation of E. coli 

15 with TSLl causes an increase in the TPS activity of the 

transformants (but no detected increase in their TPP activity) . 

Thus, both the short and the long chains make essential 
contributions to both the TPS and the TPP catalytic activities 
20 of trehalose synthase. The situation is therefore that there 
are at least two different structural genes for a trehalose 
synthase, neither of which can be completely described as the 
structural gene of either a trehalose-6-phosphate synthase 
protein or a trehalose-6-phosphate phosphatase protein. 

25 

We disclose that the amino acid sequences of peptides isolated 
from both the 86 and 93 kDa long chain fragments found in the 
truncated enzyme described by Londesborough & Vuorio [(1991) 
loc. cit] are encoded by TSLl. Surprisingly, however, none of 

30 the peptides isolated from the 99 kDa polypeptide in the intact 
enzyme is encoded by TSLl* Therefore, the structural genes 
encoding polypeptides of yeast trehalose synthase include a 
third member, TSL2 . The 99 kDa polypeptide encoded by TSL2 was 
not visible in SDS-PAGE analyses of truncated enzyme. However, 

35 ne of the 6 p ptides isolated from the 93 kDa fragment was not 
encoded by TSLl and had an amino acid sequence also found in a 
peptide isolated from the 99 kDa polypeptide. Thus, traces of a 
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degradation product of the 99 kDa polypeptide are present in 
truncated enzyme, and migrate with the 93 kDa fragment during 
SDS-PAGE. 

5 The inventors have not yet sequenced this third structural 
gene, TSL2, but disclose information that provides obvious 
methods for its isolation and cloning by a person ordinarily 
skilled in the art. Also a clone (pALK7756) comprising at least 
part of this gene has been deposited (Accession number , DSM 
10 7425; Deutsche Sammlung von Mikroorganismen und Zellkulturen, 
Mascheroder Weg 1 B, D-3300 Brauschweig) 

We disclose that the genes TSSl and TSL1 contain extensive 
similarities such that the amino acid sequence of the entire 
15 short chain is 37 % identical to residues 495 to 814 of the 
long chain. 

A novel feature of the present invention, therefore, is that in 
order to increase the capacity of a yeast or some other host 

20 • organism for trehalose synthesis it can be necessary to 

increase the expression of both the TSSl and the TSL1 and TSL2 
genes or modify these genes in some other way, * not because 
either TPS or TPP activity is "rate-limiting", but because more 
than one gene affects each activity. Thus, the results 

25 summarised above disclose that both TSSl and TSL1 affect TPS 
activity and both TSSl and TSL2 affect TPP activity . However, 
these results also disclose that the TSL2 gene product (the 99 
kDa polypeptide) isolated by chromatography is itself a 
trehalose-6-phosphatase whereas the TSSl gene product expressed 

30 in E. coli is a trehalose-6-phosphate synthase, although the 

catalytic efficiency of these separate polypeptides can be less 
than when they are correctly assembled in a trehalose synthase 
complex. 

35 A surprising finding was that the TSSl gene is identical with a 
gene variously called FDP1 or CIF1. This gene has pleiotropic 
effects on the utilization of sugars by S. cerevisiae . In 
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particular f haploid yeast bearing certain alleles of this gene 
(the so-called fdpl and cifl mutants) are unable to grow on 
mannose, or on mannose or sucrose , or on mannose, sucrose or 
fructose, or on mannose, sucrose, fructose or glucose, 
5 depending upon the severity of the defect [Van de Poll & 

Schamhart, (1977) Molecular and general Genetics 154 , 61*66; 
Banuelos, M. & Fraenkel, D.G. (1982) Molecular and Cellular 
Biology 2, 921-929]. Such mutants grow normally on galactose. 
Theref of e, during the selection of strains in which the TSS1 

10 gene has been deleted or modified it is sometimes essential and 
always advisable to grow the transformants on galactose, 
because in many cases the desired transformant will be unable 
to grow on any other common sugar, including the routinely used 
glucose. This is an unexpected methodological consideration 

15 that would not be obvious even to a person skilled in the art: 
special knowledge about the sequence and chromosomal, location 
of the TSS1 gene is required, which we now disclose'. 

Since our disclosure of the identity of TSS1 with FDP1 and CIF1 
20 in USPA 841,997, a confirmation has been published by Bell, W; r 
al- [(1992) European Journal of Biochemistry 209, 951-959)] 

The inventors' previous work [Londesborough & Vuorio (1991) 
loc. cit.] showed that the TPS catalytic activity of what is 

25 now known to be trehalose synthase requires a so-called 

TPS-Activator protein, which is a dimer of 58 kDa subuhits. We 
have identified this protein by the amino acid sequences of 
peptides it contains and by its catalytic activity and disclose 
that it is yeast phosphoglucoisomerase. We disclose that 

30 fructose-6-phosphate (F6P) , which could be made by phospho- 
glucoisomerase from the glucose-6-phosphate (G6P) in the assay 
mixtures used to measure TPS activity, is a powerful activator 
of the TPS activity of intact trehalose synthase. Also, when 
the assay mixture contains an equilibrium mixture of G6P and 

35 F6P the TPS-Activator has no further effect , so that its 

phosphoglucoisomerase activity is a complete explanation of the 
activation it causes. Furthermore, the TPS activity of 
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truncated trehalose synthase does not require F6P, and is not 
so strongly inhibited by phosphate as is that of the native 
enzyme. Thus, a trehalose synthetic pathway 'can in principle be 
transferred to any organism by transforming the organism with 
5 the structural genes for yeast trehalose synthase: it is not 
necessary to simultaneously introduce the TPS-activator, 
because F6P is a ubiquitous component of cells. Furthermore, if 
the amounts of F6P in an organism are inadequate, or phosphate 
concentrations are too high, the organism can be transformed 
10 with a truncated version of TSL1 encoding the truncated long 
chain that confers insensitivity to phosphate and F6P. This 
aspect of the present invention is particularly significant, 
because it both allows the introduction of a trehalose 
synthetic pathway to organisms in which the cytosolic phosphate 
and F6P concentrations would prevent the efficient function of 
yeast trehalose synthase, and also may permit trehalose 
synthase to function efficiently at stages of yeast growth -when 
native trehalose synthase would be inhibited by cytosolic 
phosphate. We disclose that intact trehalose synthase can be 
liberated from phosphate inhibition by treatment with trypsin 
in vitro . 

From the knowledge gained from the present invention, it is 
possible to produce trehalose recombinantly by transforming a 
host cell with the appropriately modified TSS1, TSL1 and TSL2 
genes. Methods of transformation and appropriate expression 
vectors are well-known in the art. 

Expression vectors are known in the art for both eukaryotic and 
prokaryotic systems, and the present invention contemplates use 
of both systems. For transformation of yeast at least two 
classes of promoters are contemplated. Yeast that accumulates 
more trehalose but at the usual time (i.e., after consumption - 
of fermentable carbon sources) can be made by inserting extra 
copies f the genes under their own promoters, or stronger 
promoters with similar control. Such yeast can have improved 
storage properties and stress resistance and be more economic 



WO 93/17093 



f 153/00049 



22 

sources of trehalose. Yeast that synthesizes trehalose during 
fermentation can be made by replacing the genes' own promoters 
with promoters (such as ADH1) that are active during 
fermentation. As explained above, such yeast can have increased 
5 fermentation rates, ethanol yields and resistance to osmotic 
and temper ature stress during fermentation. 

Also contemplated are modifications of the DNA sequence which 
would provide "preferred" codons for particular expression 

10 systems (e.g., bacteria and higher plants). In addition, the 
TSS1, TSL1 and TSL2 DNA sequences may be modified by certain 
deletions or insertions, provided the translated polypeptides 
are enzymatically functional. Expression of functional 
polypeptides from TSS1, TSL1 and TSL2 may be confirmed by 

15 assaying for TPS and/ or TPP activity in the expression system 
by the methods described in Londesborough and Vuorio [(1991) 
loc. cit. ] . Deletion of the first 330 amino acids or so from 
the 123 kDa long chain to give an enzyme active at higher 
phosphate and lower F6P concentrations has already been 

20 mentioned. 

The genes of the present invention may be transferred and 
expressed in plants by using the Ti plasmid system which is 
well known in the art. The internal transforming genes of a 

25 cloned T-DNA can be removed by recombinant DNA techniques and 

replaced by the genes of the present invention and expressed in 
plant tissues. Commonly, the coding sequence of the foreign 
gene (for instance, TSL1) is substituted for the coding region 
of the opine synthetase gene. In this way, the natural promoter 

30 and polyadenylation signals of the opine synthetase gene confer 
high-level expression of the foreign protein. Any method known 
in the art, however, may be used to transform higher plants 
with the genes of the present invention. 

35 The following examples are for illustration of the present 

invention and should not be construed as limiting the present 
invention in any manner. 
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EXAMPLES 

General Methods and Materials , 



Materials . Fructose 6-phosphate (F6P) and adenosine 5'- 
5 diphosphoglucose (ADPG) were from Sigma Chemicals. Glucose 
6-phosphate (G6P) , phenylmethylsulphonyl fluoride (PMSF) , 
uridine 5 ' -diphosphoglucose (UDPG) and other commercial 
reagents were from the sources stated in Londesborough & Vuorio 
[(1991) loc. cit.]. Truncated trehalose synthase (proteolytic- 

10 ally activated "TPS/P") and TPS activator were prepared as 
described in Londesborough & Vuorio [(1991) loc. cit.]. The 
antisera, anti-TFS/P, anti-57K and anti-93K were made in 
rabbits using as antigen, respectively, truncated trehalose 
synthase, the short (57 kDa) chain and the 93 kDa fragment of 

15 the long chain of trehalose synthase as described in 
Londesborough & Vuorio [(1991) loc. cit.]. 

Buffers for enzvme extra ction and purification . Two standard 
cocktails, HBMED (25 mM Hepes/KOH pH 7.0/1 mM benzamidine/1 mM 

20 MgCl 2 /0.1 mM EDTA/1 mM dithiothreitol) and HB2M1ED (HBMED but 
with final concentrations of 2 mM MgCl 2 and 1 mM EDTA) were 
used as basal buffers during preparation of cell extracts and 
purification of enzyme. Where indicated, the Hepes and 
benzamidine concentrations were increased to 50 mM and 5 mM, 

25 respectively. 

Yeasts . Commercial baker's yeast was from Alko's Rajamaki 
factory. The standard laboratory strains of S. cerevisiae used 
were X2180 (ATCC 26109) and S288C (ATCC 26108) . Mutant strains 
are described in the Examples and Table 1 lists important 
0 strains of microorganisms and plasmids. Laboratory yeast were 

routinely grown on 1 % yeast extract/ 2% peptone (YP) containing 
the indicated carbon source in aerobic shake flasks at 30 °C 
and 200 r.p.m. Cells were harvested by centrifugation for 5 
minutes at 3000 g, resuspended in distilled water and again 
5 c ntrifuged 5 minutes at 3000 g. The pellets were suspended in 
about 20 volumes of HB2M1ED and centrifuged in tared tubes for 
10 minutes at 15,000 g. Tubes and pellets were weighed to give 



WO 93/17093 



PCT/FI93/00049 



24 

the mass of fresh yeast. For trehalose determinations , portions 
of the pellets were treated as described by Lillie, S.H. & 
Pringle, J.R. [(1980) Journal of Bacteriology 143, 1384-1394]. 
The washed cells were broken by suspending them at 0 °C in 1 to $ 
5 4 volumes of HB2M1ED, adding fresh stock PMSF/pepstatin (1 mg 

pepstatin A/ml 0.1 M PMSF in methanol) to give final * 
concentrations of 10 fxg pepstatin/ml and 1 mM PMSF, and shaking 

with^rl ass bead s for t hree 1 minute peri ods in a Braun MK II 

homogenizer or (for amounts less than 0.3 g fresh yeast) by 

10 vortexing in an Eppendorf tube. The glass beads were removed 

and the volume of homogenate was measured. Samples for SDS-PAGE 
were made at once by dilution with Laeinmli sample buffer 
[Laemmli, U.K. (1970) Nature, London 227 , 680-685]. The 
homogenates were then centrifuged as indicated (usually 5 min 

15 at 5,000 g or 20 minutes at 28,000 g) . Enzyme assays were made 
on the homogenates and supernatants and protein determined in 
the supernatants from A280 and A260 measurements. 

20 . Table 1. List of important strains and plasmids 

Name Description Source 

Saccfraromyces cerevisiae 

25 

X2180 (ATCC 26109) Standard laboratory yeast (diploid) 

S288C (ATCC 26108) Standard laboratory yeast (haploid) 

30 Klg 102 cif 1-102, leul, ural, trp5, MATa 1 

MV6807 fdpl, leu2, ura3, his3, lys2, ade8, 2 

trpl, MATa § 

35 S150-2B leu2, his3, trpl, ura3, Mata 



ALK03569 



tssl::LEU2 (from S150-2B) 



This work 
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tssl::LEU2 (from S150-2B) 
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This work 



WDC-3A 



cifl::HIS3 f his3 , ura3, ade2 f M&Ta 



Escherichia coli 



HB101 (ALKO 683) 



10 ALK03566 



HB101 containing pALK752 



This work 



ALK03568 



HB101 containing pALK754 



This work 



15 



Plasntids 

PALK751 (DSM 6928) pBluescript containing an 8.-2 kb This work 

insert comprising TSL1 



20 



PALK752 



pBluescript containing a 2.5kb This work 
insert comprising TSS1 



PALK753 



pBluescript containing a 3.3 kb This work 
insert comprising the ORF of TSS1 



15 pALK754 



pBluescript containing a 4.4 kb This work 
insert comprising TSL1 



10 



PALK756 (DSM 7425) pBluescript containing a 3.5 kb This work 

insert comprising at least part 
of TSL2 



PALK757 



pBluescript containing an This work 

insert comprising the ORF of TSL1 



pMB14 



YEp352 containing CIF1 
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Sources: 1. Dr. D. Fraenkel, Harvard Medical School, U.S.A. 

2. Dr. J. Thevelein, Lab voor Plantenbioch. , Heverlee, 
Belgium. 

3. Dr. c. Gancedo, CSIC, Madrid, Spain. 



_ Enzv^e As says . TP P and TPS st andard assa ys a nd o ther kinetic 
measurements were made as described by Londes borough & Vuorio 
10 [(1991) loc. cit.] except that the standard TPS assay mixture 
contained 5 mM F6P unless stated otherwise. Where appropriate, 
TPS assays were corrected by measuring UDP production from UDPG 
in the absence of G6P and F6P. 

15 DNA manipulations . Stratagene's (La Jolla, California) 

Escherichia coli strain XL-1 Blue {recAl, endAl, gyrA96, thi, 
hsdR17, supE44," relAl, lac, [F' proAB, laclg ADM15, TnlO 
(tetR)]> were used as host bacteria. When needed, XL-1 Blue 
cells were made competent by the method of Handel & Higa (1970) 

20 Journal of Molecular Biology 53, 159-162]. The cloning vector 
was Stratagene's Lambda Zap II, predigested with EcoRI, where 
the cloning site is near the N-terminus of the gene for 
B-galactosidase, thus enabling the color selection of 
recombinant clones. The seguencing vectors M13mpl8 and M13mpl9 

25 from Pharmacia LKB Biotechnology were also used. 

High molecular mass DNA from the haploid S288C strain was 
prepared as described Johnston, J.R. [(1988) in Yeast, A 
Practical Approach, XRL Press, Oxford] and partially digested 

30 with either HaelXI or EcoRI restriction enzyme. For the large 
scale Haerir digestion, e.g., a reaction mixture of 330 jtil 
containing 30 fig of DNA and 4.8 U of enzyme was incubated at 37 
°C for 60 minutes. The reaction was stopped with 10 jlcI of 0.5 M 
EDTA and transferred to ice. The methods for such digestions 

35 and their agarose gel electrophoretic analysis are well known 
in the art and are described, e.g., in Sambrook et al. . 
Molecular Cloning, A Laboratory Manual [Cold Spring Harbor 
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Laboratory Press, 2nd ed. , (1989)]. 

Plasmid DNA was isolated using standard methods for small scale 
purification Sambrook et al . [(1989) Molecular Cloning, A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory 
Press, hereby expressly incorporated by reference]. Large scale 
purifications of plasmid DNA were done with Qiagen tip-100 
columns from Diagen following their instructions. 



DNA sequences were determined either manually by the dideoxy- 
chain termination method [Sanger et al. (1977) Proceedings of 
the National Academy of Sciences U.S.A. 74, 5463-5467], 
sequencing directly from pBluescript plasmids, or automatically 
with the Applied Biosystems Model 373A automatic DNA sequencer, 
sequencing either directly from these plasmids or from M13 
subclones. ' 

Southern and Western hybridizations and other standard 
manipulations were carried out by well known procedures [see, 
e.g., Sambrook et al. (1989) loc. cit. ] . 

Example 1. Purification of intact trehalose synthase 

Intact trehalose synthase was purified from commercial baker's 
yeast. The method described by Londesborough & Vuorio [(1991) 
loc. cit.] for purification of "proteolytically activated 
TPS/P" was modified as follows: 

1. All buffers contained 2 raM MgCl 2 and 1 mM EDTA. This 
increased yields in the early steps and probably helped to 
decrease proteolysis in the later steps. 

2. In the first ammonium sulphate fractionation, the EDTA 
concentration was increased to 2.5 mM before addition of 
ammonium sulphate. 

3. All buffers were adjusted to between 0.4 and 1 mM PMSF and 
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between 4 and 10 jzg pepstatin A/ml by addition, immediately 
before use, of the appropriate amount of a freshly prepared 
stock solution containing 1 mg pepstatin A/ml 0.1 M PMSF in 
methanol (called, stock PMSF/pepstatin) . When, as in 
5 chromatography, buffers were used for several hours, more stock 
PMSF/pepstatin was added at intervals, but so as not to exceed 

1.5 % methanol in the buffer, or a fresh lot of buffer was 
taken into use, because of the short half-life of PMSF in 
aqueous solution. All columns were equilibrated with at least 

10 one bed volume of buffer containing PMSF and pepstatin A 
immediately before application of enzyme. 

4. Experience permitted the enzyme-containing fractions (a 
total of 17 ,8 ml in the preparation of Table 2) from 

15 Heparin-Sepharose to be identified as soon as they were eluted. 
Stock PMSF/pepstatin (150 fil) and 0.1 M EDTA (200 fil) were 
immediately added to them. Then 7.2 g of powdered ammonium 
sulphate was slowly added (over 20 min) . After 3 0 min 
equilibration, the mixture was centrifuged 15 min at 28,000 g- 

20 The pellets were packed for 5 min at 28,000 g and expressed 
buffer was removed with a pasteur pipette. The pellets were 
dissolved to 2.0 ml in HB2M1ED containing 0.8 mM PMSF and 8>g 
pepstatin A/ml, centrifuged 5 min .at. 28,000 g and applied to a 

2.6 x 34 cm column of Sepharose 6B freshly equilibrated with 
25 HB2M1ED containing 50 mM NaCl, 0.4 mM PMSF and 4 fig pepstatin 

A/ml. The interval between elution from Heparin-Sepharose and 
application to Sepharose 6B was 5 h. In the Londesborough & 
Vuorio [(1991) loc. cit.] procedure, the Heparin-Sepharose 
eluates were stored at about 3 °C, without addition of PMSF or 
30 pepstatin A, for 5 days before the second ammonium sulphate 
fractionation and application to Sepharose 6B. 

5. Fractions (3.7 ml) from the Sepharose 6B column were 
immediately mixed with 20 /il of stock PMSF/pepstatin and then 

35 assayed. Again, experience permitted the correct fractions to 
be pooled, based on activity and A280 measurements without 
SDS-PAGE analysis, and immediately applied to a 0.7 x 7 cm 
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column of UDP-Glucuronate-Agarose equilibrated with HB2M1ED 
containing 50 roM NaCl, 0.4 mM PMSF and 4 jug pepstatin A/ml. The 
enzyme was e luted as described by Londes borough & Vuorio 
[(1991) loc. cit.] and 10 ul of stock PMSF/pepstatin added to 
5 each 1.7 ml fraction. Each fraction was divided into three. Two 
portions were stored at -70 °C and one at 0 °C. 

Table 2 summarizes a P^i^ication and Fig. 1 shows the SDS-PAGE 
analysis of fractions eluted from UDP-Glucuronate-Agarose. No 
0 obvious differences were apparent between enzyme eluted by 0.2 
M Naci and that eluted by 10 mM UDPG/0.4 M NaCl. The major 
bands present had molecular masses of 57, 99 and 123 kDa. 
Several weaker bands were present between the 123 kDa band and 
about 90 kDa. In Western analyses the 123 kDa, 99 kDa and most, 
5 if not all, of the fainter bands in this region were recognized 
by the anti-TPS/P and anti-93K sera. This suggests that the 
fainter bands are partially degraded long chains. The weak 
bands at 68 kDa also reacted with the anti-93K serum, but could 
be removed by chromatography on DEAE-cellulose (see Example 9) . 
) When the antibodies from anti-93K serum that bound to the 99 
kDa band were eluted from a nitrocellulose blot [as described 
by Pringle, \T.R. (1991) Methods in Enzymolgy 194, 565-590] and 
used to probe another blot, they bound also to the 123 kDa 
band, showing that the two long chains of trehalose synthase 
25 have epitopes in common. 

Intact enzyme binds less tightly to the UDP-Glucuronate-Agarose 
than the truncated enzyme purified by Londesborough & Vuorio 
[(1991) loc. cit.] and the proportion of enzyme remaining bound 
at 0.2 M NaCl varied from preparation to preparation. When #9 
of Table 2 was re-run on the same column, 76 % of the TPS 
activity was again recovered at 0.2 M NaCl (and 25 % by 0.4 M 
NaCl/10 mM UDPG) , so that overloading of the column is not the 
reason why this enzyme eluted at 0.2 M NaCl. However, when 
enzyme eluted at 0.2 M NaCl was truncated with trypsin as 
described in Example 10, it then bound to the column at 0.2 M 
NaCl and was only recovered at 0.4 M NaCl/ 10 mM UDPG. Thus, as 
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well as altering the kinetic properties of the enzyme (see 
Examples 10 & 12), this truncation also increases the affinity 
for UDP-Glucuronate-Agarose. Presumably there are subtle 
differences in factors such as the amount of adventitious 
5 proteolysis and state of aggregation between enzyme eluted at 
0.2 M NaCl and that remaining bound. For the preparation 
summarised in Table 2, the ratio of standard TPP and standard 
TPS activities increased from 22 % in #9 to 39 % in #14, 
showing that there are differences, even though they could not 
10 be clearly detected by SDS-PAGE. 

These findings disclose that a highly purified trehalose 
synthase containing a 57 kDa short chain, a 123 JcDa long chain 
and a 99 kDa polypeptide that is recognised by the anti-93K 

15 serum possesses both TPS activity activatable by TPSActivator 
protein (or F6P) and TPP activity. The rate of * hydrolysis of 1 
mM G6P in either phosphate or Hepes buffer was less than 1 % of 
that of 0.5 mM trehalose-6-phosphate, so that the TPP activity 
is highly specific. An unexpected finding is that this highly 

20 purified preparation contains the 99 JcDa polypeptide, which is 
not present in the purified truncated trehalose synthase. It is 
disclosed later that this polypeptide is not a degradation 
product of the long (123 kDa) chain, whereas both the 86 and 93 
kDa polypeptides of truncated enzyme contain amino acid 

25 sequences that identify them as fragments of the long (123 kDa) 
chain. This novel preparation possesses some unexpected 
catalytic properties, which are described in more detail in 
Example 11. 

30 Table 2. Purification of intact trehalose synthase 

The preparation is from 60 g of pressed baker's yeast. TPS 
activities "Without Activator" were measured as described by 
Londesborough & Vuorio [(1991) loc. cit.], i.e., in the absence 
35 of F6P. Assays "With Activator" were determined similarly but 
in the presence of a saturating amount of pure TPS activator 
(similar values were obtained when some fractions were later 
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assayed in th presence of 5 mM F6P instead of TPS activator, 
and are shown in parentheses) * ND, not determined. 



Fraction 



Volume Without Activator 
(ml) U/rol U/mq Total U 



With Activator 
U/nil U/mcr Total U 



1st (NH 4 ) 2 S0 4 Precipitate 


13.4 


58 


1.0 


810 


G25 eluate 


22.2 


30 


1.1 


668 


Heparin-Sepharose eluate 


18.2 


ND 


ND 


ND 


Sepharose 6B eluate 


26 


1.4 


5.1 


36 


ODP-glucuronate agarose eluates 


• 








-at.-0-2-M-NaCl- 










# 9 


1.7 


4.6 


3.1 




# 10 


1.7 


WD 


ND 


ND 


# 11 


1.7 


ND 


ND 


ND 


# 12 


1.7 


ND 


ND 


ND 


at 0.4 M NaCl/10 mM DDPG 










# 13 


1.7 


2.1 






# 14 


1.7 


3.7 


27 b 




# 15 


1.7 


ND 







ND ND 
ND ND 
*21 -11 
4.7 17 



ND*> 
ND 
»380 
121 



11.5(12) 12 

12.2 21 

6.3 23 58 

3.9(3.3) 22 

5.9(6,2) 25-30 a 

9.3 25-30 a 

0.8 



a Based on protein .contents estimated from Coomassie blue- stained SDS-PAGE 
gels 

b Results from other preparations show that the activity with 
excess TPS-activator (or 5 mM F6P) is not, at this step, more 
than 10 % greater than that without activator. 

Example 2 Increased expre ssion bv S. cerevisiae of the long and 
short chains of trehalo se synthase after consumption of glucose 

Three 500 ml lots of YP/2 % glucose in 1 1 shake flasks were 
each inoculated with 1 ml of a suspension of X2180 cells of 
A600 1.0 and shaken at 200 r.p.m. at 30 °C. At the times shown 
in Table 3, the cells were harvested, broken and analyzed as 
described in General Materials and Methods. The 28,000 g 
supernatant s were stored for a week at -18 °c, thawed and 
re-centrifuged for 20 min at 28,000 g. Portions of 150 pi (each 
equivalent to 53 mg of fresh yeast) were mixed with 30 /xl of 
anti-TPS/P serum, equilibrated for 30 min at 0 °C and 
centrifuged for 10 min at 10,000 g. The pellets were washed 
with 250 Ml of HBMED and then dissolved in Laeiranli sample 
buffer and subjected to SDS-PAGE (Fig 2). Bands at 57, 99 and 
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123 kDa were strong in the sample (C) from stationary phase 
yeast and in the sample (B) harvested immediately after 
disappearance of glucose from the medium, but were absent or 
very weak in the sample (A) from yeast growing in the presence 
5 of 1.2 % glucose. 

Table 3. Appearance of TPS and TPP activities in X2180 yeast 
grown on YP/2 % glucose. 

_ .-£0 Enzymes were -assayed in~the -28y000~ g supernatants . 





A 


B 


C 


Age (h.) 


16.1 


18.1 


39.0 


Residual glucose 


1.2 


50.001 


50.001 


(g/100 ml medium) 








Fresh, yeast mass 


7.6 


14.8 


29.5 


(mg/ml medium) 








Trehalose 


0.73 


3.1 


94 


(mg/g dry yeast) 








TPS (U/g fresh yeast) 


1.2 


7.4 


10.5 


TPP (U/g fresh yeast) 


0.29 


2'. 2 


3.0 


TPP/TPS (%) 


24 


30 


29 



Control experiments (not shown) indicated that pre- immune serum 
did not precipitate the 57, 99 and 123 kDa bands, and that 
using 50 fil of serum instead of 30 Ml did not precipitate more 
30 of these three bands from the C sample. 

These results disclose that the co-ordinate, 7-fold increase in 
TPS and TPP activities that occurs during less than 2 h when 
glucose disappears from the medium is accompanied by increases 

35 in the amounts in yeast of three polypeptides, of mass 57, 99 
and 123 JcDa, that are immunoprecipitated by anti-TPS/P serum. 
These polypeptides are those found in the intact trehalose 
synthase purified in Example l. Thus, increase in the amount of 
enzyme protein is a major mechanism by which the capacity of 

40 yeast to synthesize trehalose is increased- 
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Example 3 Determination of t he N-terminal amino acid sequences 
of peptides isolated from the va rious polypeptides of trehalose 
synthase 

The 57, 86 and 93 kDa polypeptides of the truncated trehalose 
synthase were separated by SDS-PAGE, digested on nitro- 
cellulose blots and fractionated by HPLC as described by 
Londesborough & Vuorio [(1991) loc. cit.]. Also, these 
polypeptides and polypeptides of molecular mass 57, 99 and 123 
kDa immunoprecipitated from yeast extracts as described in 
Example 2 were separated by SDS-PAGE and digested in the gel 
with lysylendopeptidase C as described by Kawasaki, H., Emori, 
Y. and Suzuki, K. (in press). The derived peptides were 
separated by HPLC using a DEAE pre-column before the reverse- 
phase column essentially as described by Kawasaki et al [(1990) 
Analytical Biochemistry 186, 264-268]. The 99 kDa polypeptide 
isolated by chromatography on phosphocellulose in the absence 
of triton (see Example 9) was digested with lysylendopeptidase 
C and the peptides separated by HPLC. In all cases, isolated 
peptides were sequenced in a gas-pulsed liquid phase sequencer 
as described by Kalkinen, N. & Tilgman, C [ (1988) journal of 
Protein Chemistry 7, 242-243], the released PTH-amino acids 
being analysed by on-line, narrow-bore, reverse-phase HPLC. The 
sequences are shown in Table 4. 

Table 4. N-terminal amino acid se quences of peptides isolated 
frpm (fragments of) the polypeptides of trehalose synthase. 

When two sequences were obtained from the same HPLC peak, they 
are shown as a and b sequences, where possible according to the 
sequences predicted from the genes. Tentative identifications 
from the amino acid sequencer are shown by the one letter codes 
followed by double queries. Unidentified residues are shown by 
Xaa. (In the Sequence Listings, also tentatively identified 
residues are indicated as Xaa) . The location of each amino acid 
sequence in the short (S) and long (123 kDa) (L) chains of Figs 
3b and 4b is shown below the sequence. 
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Short (57) chain peptides 

Tryptic peptides from blots 'of the 57 kDa polypeptide from 
truncated trehalose synthase . 

5 

848 Tyr-Ile-Ser-Lys 

(SEQ ID NO:5) (S 463-66) 

850 Asp-Val-Glu-Glu-Tyr-Gln-Tyr-Leu-Arg 
10 (SEQ ID NO: 6) (S 333-41) 

859 His-Phe-Leu-Ser-Ser-Val-Gln-Arg 
(SEQ ID N0:7) (S 223-30) 

15 862a Val-Leu-Asn-Val-Asn-Thr-Leu-Pro-Asn-Gly-Val-Glu- 
Tyr-Gln 
(SEQ ID N0:8) (S 231-44) 

862b Ser-Val-Val-Asn-Glu-Leu-Val-Gly-Arg 
20 (SEQ ID N0:9) (S 342-50) 

863 Leu-Tyr-Lys 

(S 460-2) 

25 864 Glu-Thr-Phe-Lys 

(SEQ ID N0:10) (S 280-3) 

866 Leu-Asp-Tyr-Ile-Lys 

(SEQ ID NO: 11) (S 294-8) 



30 



870 I le-Leu-Pro-Val-Arg 

(SEQ ID N0:12) (S 196-200) 
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From lysylendopeptidase C digests of immunoprecipitated 57 kDa 
band 

t 

966a Glu-Val-Asn-Xaa-Glu-Lys 

(SEQ ID NO: 13) (S 454-9) 



966b Phe-Tyr-Asp-Xaa-L?? 

(SEQ ID NO: 14) (not found) 



980 Leu-Xaa-Ala-Met-Glu-Val-Phe-Leu-Asn-Glu-Xaa-Pro-Glu 
(SEQ ID NO: 15) (S 304-16) 

981 Tyr-Thr-Ser-Ala-Phe-Trp-Gly-Glu-Asn-Phe-VaL-Xaa- 
Glu-Leu 

(SEQ ID NO: 16) (S 467-80) 

987 Phe-Gly-Xaa -Pro-Gly-Leu-Glu-I le-Pro . 

(SEQ ID NO: 17) (S 63-71) 

Long (123 kDal chain peptides 

Trvptic peptides from blots of the 8 6 and 93 kDa fragments . 

889 D??-Gly-Ser-Val-Met-Gln 

(SEQ ID NO: 18) (L 587-592) 

890/891 Leu -Pro-G ly-S er-Tyr-Tyr-Ly s 
(SEQ ID NO: 19) (L 917-23) 

892a Ala-Ile-Val-Val-Asn-Pro-Met-Asp-Ser-Val-Aia 
(SEQ ID NO: 20) (see peptide 1299) 



892b 



Met-Ile-Ser-Ile-Leu 

(SEQ ID NO: 21) (L 842-7) 
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From lysvlendopeptidase digest of combined 86 and 93 kDa 
fragments ■ 

117 1 Ar g-Arg-Pr o-Gln-Trp-Ly s 

(SEQ ID N0:22) (L 770-5) 



From lvsvlendooeptidase diges t of the 86 kDa fragment. 

10 1479 Thr-Leu-Met-Glu-Asp-Tyr-Gln-Ser-Ser-Lys 
(SEQ ID NO: 52) (L 816-26) 



1483a Ala-Phe-Glu-Asp-His-Ser-Trp-Lys 
(SEQ ID N0:78) (L 445-52) 

1483b Ala-Gly-His-Ala-Ile-Val-Tyr-Gly-Asp-Ala-Thr-Ser-Thr- 
Tyr-Ala-Lys . 



15 



(SEQ ID NO: 79) (L 1064-79) 

2 0 148 1 Glu-Arg-Leu-Pro-Gly-Ser-Tyr-Tyr-Lys 

(SEQ ID NO 80) (L 914-23) 

From lvsylendopeptidase digest of the 93 kDa fragment. 

25 1480 Thr-Leu-Met-Glu-Asp-Tyr-Gln 

(SEQ ID N0:81) (L 816-23) 



30 



1484a Ala-Phe-Glu-Asp-His-Ser-Trp-Lys 
(SEQ ID NO: 78) (L 445-52) 

1484b Ala-Gly-His-Ala-Ile-Val-Tyr-Gly-Asp-Ala-Thr-Ser- 
Thr-Tyr-Ala-Lys 
(SEQ ID NO: 79) (L 1064-79) 



35 1485 Glu-Arg-Leu-Pro-Gly-Ser-Tyr-Tyr-Lys 

(SEQ ID NO:80) (L 914-23) 
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From lvsylendopeptidase digests of immunoprecipitated 124 kDa 
band 

1047 Ser-D??-Pro-Gln-Lys 

(SEQ ID NO: 23) (not found) 

1048 Phe-Tyr-Arg-Asn-Leu-Asn-Gln-Arg-Phe-Ala-Asp-Ala- 
Ile-Val-Lys 

(SEQ ID NO: 24) (L 453-67) 

1054a Asp-Gly-Ser-Val-Met-Gln-W??-Xaa-Gln-Leu-I?? 
(SEQ ID NO: 25) (L 587-97) 

1054b Asn-Ala-Ile-Asn-Thr-Ala-Val-Leu-Glu-Asn-Ile-Ile- 
Pro-H??-Xaa-H??-Val-Lys 
(SEQ ID N0:26) (L 360-77) 

1061 Leu-Val-Asn-Asp-Glu-Ala-Ser-Glu-Gly-Gln-Val-Lys 
(SEQ ID NO: 27) (L 1052-63) 

1063 V??-Gln-Asp-Ile-Leu-Leu-Asn-Asn-Thr-Phe-N?? 
. (SEQ ID NO: 28) (not found) 

1375 Phe-Leu-Val-Glu-Asn-Pro-Glu-Tyr-Val-Glu-Lys 
(SEQ ID NO:50) (L 629-39) 

1376 R??-Ile-Thr-Pro-His-Leu-Thr-Ala-Xaa-Ala-Ala 
(SEQ ID NO: 51) (L 245-55) 

1377 Thr-Leu-Met-Glu-Asp-Tyr-Gln-Ser-Ser-Lys 
(SEQ ID NO:52) (L 816-26) 

1378-1 Ile-Leu-Glu-Gly-Leu-Thr-Gly-Ala-Asp-Phe-Val-Gly- 
Phe-Gln-Thr 

(SEQ ID NO: 53) (L 521-35) 
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1378-11 Gln-Ile-Leu-Xaa-Pro-Thr-Leu-Xaa-Tyr-Gln-Ile-Pro- 
Asp-Asn 
(SEQ ID NO: 54) (L 427-40) 

5 1380 Phe-Gly-Gly-Tyr-Ser-Asn-Lys 
(SEQ ID NO: 55) (L 319-25) 



10 



-1-3 81 Phe-Xaa-Thr-Glu-Asn-Ala-Glu-Asp-Gln-Asp-Xaa-Val- 

Ala-Xaa-Val-Ile-Gly-G??-Ala-Ile-Xaa-Xaa-Ile 
(SEQ ID NO: 56) (L 931-53) 



15 



1382 Xaa-Val-Gly-Thr-Val-Gly-Ile-Pro-Tlir-Asp-Glu-Ile- 
Pro-Glu-Asn-Ile-Leu-Ala 
CSEQ ID NO: 57) (L 378-95). 

The 99 kDa polypeptide 



20 



From lysvlendopeptidase digests of immunoprecipitated 99 kDa 
band 



959 



Asp-Thr-Thr-Gln-Tlir-Ala-Pro-Val-T??-Asn-Asn-Val- 
Xaa-Pro 
(SEQ ID NO: 29) 



25 961 



Asn-Gln-Leu-Asp-Ala-A??-Asn-Tyr-Ala-Glu-Val 
(SEQ ID NO: 30) 



30 



1002a Asn-Leu-Ser-Arg-Trp-Arg-Asn-Tyr-Ala-Glu 
(SEQ ID NO: 31) 

1002b Trp -Gln-Gly-Lys 

(SEQ ID NO: 32) 



1043 

35 



Ile-Gln-Leu-Gly-Glu-Ser-Asn-Asp-Asp-D??-L?? 
(SEQ ID NO: 33) 
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Glu-Val-Pro-Thr-Ile-Gln-Asp-Xaa-Thr~Asn-Lys 
(SEQ ID NO: 34) 

Xaa-Tyr-Xaa-Tyr-Val-Lys 
(SEQ ID NO: 35) 

Asn-Gln-Leu-Gly-Asn-Tyr 

— (-SEQ— I-D— NO-:-3 6-)- 

Va 1- Al a -Leu-Thr 
(SEQ ID NO: 37) 

Asp-Ala~Ile-Val-Val-Asn-Pro-Xaa-Asp-Ser«-Val-Ala 
(SEQ ID NO: 38) 

' Ser-Leu-Leu-Asp-Ala-Gly-Ala-Lys 
(SEQ ID NO: 44) 

Glu-Lys-Pro-Gln-Asp-Leu-Asp-Asp-Asp-Pro^Leu-Tyr- 
Leu-Thr 
(SEQ ID NO: 45) 

D??-Gln-Xaa-His-Gln-Asp-Xaa-Xaa-Asn-Leu-Thr 
(SEQ ID NO: 46) 

Phe-Asn-Asp-Glu-Ser-Ile-Ile-Ile-Gly-Tyr-Phe-P??- 
Xaa-Ala-Pro 
(SEQ ID NO: 47) 

Ser-Arg-Leu-Phe-Leu-Phe-Asp-Tyr-Asp-Gly-Thr-Leu- 
Thr-Pro 
(SEQ ID NO: 48) 
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From lvsvlendonentidase digest of 99 kDa protein purified on 
pfaosphocellulose , 

1451 Gln-Leu-Gly-Asn-Tyr-Gly-Phe-Tyr-Pro-Val-Tyr 
5 (SEQ ID NO:49) 



Apart from peptide 966b, all the amino acid sequences 

10 determined from the short chain samples have been located in 
the protein sequence deduced from the TSS1 gene (see Figure 
3b). Apart from peptides 892a, 1047 and 1063, all the amino 
acid sequences determined from the 86 and 93 kDa fragments of 
the long chain and from the intact 123 kDa long chain itself 

15 have been located in the protein sequence deduced from TSL1. 

The HPLC profiles obtained from digests of the 86 kDa fragment 
were essentially identical with those from digests of the 93 
kDa fragment when either trypsin or lysylendopeptidase C was 
used (not shown) • Also, corresponding HPLC peaks from 86 and 93 

20 kDa digests yielded the same sequences or double sequences 

(peptide pairs 890 & 891; 1479 & 1480; 1483a, b & 1484a, b; 1481 
& 1485) . These results disclose that both the 86 and 93 kDa 
polypeptides in truncated enzyme are derived from the 123 kDa 
long chain encoded by TSL1. In particular, it is not the case 

25 that one or other of these fragments is derived from the 99 kDa 
polypeptide, although contamination with minor amounts of 
(degradation products of) that polypeptide is probable (see 
below) . 

0 None of the 16 amino acid sequences obtained from the 99 kDa 

polypeptide is encoded by TSL1. The first 5 residues of peptide 
1451 from the 99 kDa polypeptide purified on phosphocellulose 
are identical with the last 5 residues of peptide 1297a from 
immunoprecipitated 99 kDa polypeptide. This confirms that the 

5 99 kDa polypeptide immunoprecipitated by anti-TPS/P serum from 
yeast extracts is the same as the 99 kDa polypeptide in 
purified intact enzyme. These results disclose that the 99 kDa 
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polypeptide is not encoded by TSL1 (or TSS1) but by another 
gene, which the inventors call TSL2. 

The origin of peptides 1047 and 1067 found in the digest of the 
5 intact (123 kDa) long chain is not known. The only peptide from 
the long chain fragments of truncated enzyme not encoded by 
TSL1 is 892a from the 93 kDa fragment. This is identical with 
th e last 11 resid ues of pep tide 129 9 from the 99 kDa 
polypeptide. This suggests that the 93 kDa band was 

10 contaminated with some material derived from the 99 kDa 

polypeptide, although this polypeptide itself was not visible 
in SDS-PAGE analyses of the truncated enzyme. The identical 
HPLC profiles of digests of the 86 and 93 kDa fragments and the 
fact that only one peptide derived from the 99 kDa polypeptide 

15 was identified in these digests shows that the contamination 

was at a low level. This discloses that a functional truncated 
trehalose • synthase with both TPS and TPP activities probably 
requires only polypeptides encoded by TSS1 and TSL1. 

20 Example 4 Cloning and sequencing of TSSl 

(a) Preparation and screening of a yeast genomic DNA library 

A genomic library was constructed in the bacteriophage lambda 
vector, Lambda Zap II, using a partial Haelll digest of 

25 cerevisiae strain S288C chromosomal DNA, according to 

Stratagene' s Instruction Manual for the Zap-cDNA synthesis kit. 
The DNA from the ligation reaction was packaged into Giga II 
Gold packaging extract (Stratagene) according to the 
manufacturer's instructions (1990). The titer of the 
0 recombinants was determined on Luria broth plates containing 
X-B-galactoside (5-bromo-4-chloro-3-indoyl-B-D-galacto- 
pyranoside) as a chromogenic substrate for 6-galactosidase and 
IPTG (isopropyl 6-D-thiogalactopyranoside) as an inducer. 
About 50,000 recombinants were amplified on large (150 mm) 

5 NZY-plates according to Stratagene 's instructions. The titre of 
the resulting library was 5 x 10 9 pfu/ml with a total of 150 
ml. 
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Several positive clones were found by screening with anti-TPS/P 
serum* After three rounds of purification, all clones were 
positive. They were screened again, now with anti-57K serum. 

5 For further manipulations of DNA, the plasmid part, 

pBluescript, of the Lambda Zap vector was excised as described 
in the manual for Predigested Lambda ZapII/EcoRl Cloning Kit 
(1989) i . 

10 (b) Sequencing of TSS1 

A strongly positive clone from the Lambda ZapII library was 
selected and sequenced manually. The sequence obtained included 
an open reading frame that encoded a 58 kDa protein, but none 
15 of the short chain peptide sequences disclosed in Example 3 was 
found in the amino acid sequence encoded by this ORF. 

Therefore, a second clone was selected, from a group of clones 
that gave distinct restriction maps compared with the group 

20 - including the first clone. It also responded less strongly to 
anti-57K serum, which is why it was not chosen in the first 
place. It was sequenced using the Exonuclease XII/Mung Bean 
nuclease system for producing series of unidirectional 
deletions. The deletions were prepared according to 

25 Stratagene's manual for the pBluescript Exo/Mung UNA sequencing 
system. The plasmid was first digested with the restriction 
enzymes SacI, which leaves a 3' overhang, and BamHI, which 
leaves a 5' overhang. For filling in possible recessed 
3 'termini created by Mung Bean nuclease, 2.5 pi of 10X 

30 nick-translation buffer, 1 pi of dNTP (a mixture of all four 
dNTPs, each at 2 mM) and 1 pi (2U) of Klenow fragment were 
added. The reaction proceeded for 3 0 min at room temperature 
and was then stopped with 1 pi of 0,5 M EDTA [Sambrook et al. 
(1989) loc. cit.]. The deletion time points were run on a 0.8 % 

35 low melting agarose gel. The bands were cut out, melted and 

ligated according to Stratagene's instructions. Portions (5 pi) 
of each ligation mixture were used to transform XL-1 Blue 
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cells. 

The clone proved to encode all the short chain peptide 
sequences disclosed in Example 3, except the poorly defined 
5 pentapeptide, 966b. It is notable that the anti-57K serum alone 
was an inadequate tool for cloning this gene: the amino acid 
sequence data disclosed in Example 3 were also essential. 

_Comp^ison of sequences with, th e Microgenie Data _ Bank, shoved 

that the gene sequence of the clone was available as an unknown 

10 reading frame in the post-translational region of the gene for 
yeast (S. cerevisiae) vacuolar H + -ATPase. The data in the bank 
contain sequence errors, and have thus been erroneously 
interpreted as two short unidentified ORFs instead of one long 
ORF- The complete sequence of the TSS1 gene with 800 bp of 

15 promoter and 200 bp of terminator regions is disclosed as SEQ 
ID N0:1 and the amino acid sequence deduced from its ORF 
(starting at nucleotide 796) as SEQ ID NO: 2 . SEQ ID NO : 1 now 
incorporates the following minor corrections to the promoter 
region, made since February 14th 1992: the original nucleotides 

20 60 and 61 (CA) become AC f original nucleotides 646 to 653 

(CGCGTGGT) become GCCGGG and the original nucleotide 711 (C) is 
deleted. Fig 3 A shows the promoter and terminator regions, and 
Fig 3b shows the deduced amino acid sequence . 

25 Example 5 Cloning and sequencing of TSL1 and TSL2 

(a) Preparation and screening of genomic DNA libraries 

The gene TSL1 was first found in the same library as described 
in Example 4. Screening was done using first ahti-TPS/P serum 

30 and then anti-93K serum. Later, another library was constructed 
from a partial EcoRl digest of chromosomal DNA from S. 
cerevisiae . strain S288C, using the methods described in 
Example 4. The anti-93K positive clones were classified by 
restriction mapping into groups, not all of which can represent 

35 TSL1. 
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(to) Sequencing of TSLl 

Clones from one group of anti-93K positive clones from the 
Haelll library were partially sequenced manually and then 
5 automatically from pBluescript exonuclease deletion series as 
described in Example 4. 

The-HaeIII~clones~did-not-contain-the-whole of—this— long-gene, 

and the N-terminus was not found in any clone. Therefore, the 

10 new EcoRl library was constructed and screened, first with 

anti-93 serum and then with nucleotide probes derived from the 
sequenced parts of TSLl. 

Several anti-93K positive clones, which also hybridized with 
15 the nucleotide probes, were obtained. These contained a plasmid 
with an 8.2 kb insert. From this plasmid a 2 kb fragment was 
cut with restriction enzymes StuI and Seal, religated into the 
pBluescript Smal site and sequenced using exonuclease 
deletions. The deletions were started using the enzymes Sad 
20 and Spel. Sequencing was done with the automatic sequencer. The 
sequence of TSLl was thus completed. 

The complete sequence is contained in the 8.2 kb insert of the 
EcoRI clones, and has been deposited as plasmid pALK751 on 
25 February 18, 1992 with the Deutsche Sammlung von 

Microorganismen (DSM) , Gesellschaf t fur Biotechnologische 
Forschung GmbH, Grisebachstr . 8 f 3400 Gottingen, Germany and 
given the accession number DSM 6928. 

30 The sequence is shown as SEQ ID NO: 83. Nucleotides 2282 to 5575 
comprise an ORF that encodes the amino acid sequence SEQ ID NO: 
82. The promoter and terminator regions and amino acid sequence 
are also shown in Fig 4. The amino acid sequence includes the 
amino acid sequences obtained from (fragments of) the long (123 

35 kDa) chain of trehalose synthase disclosed and discussed in 
Example 3. 
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(c) Isolation and sequencing of TSL2 

The information disclosed about the 99 kDa polypeptide 
{especially in Examples 1 & 3) provides obvious procedures for 
the isolation and characterization of the TSL2 gene by one 
ordinarily skilled in the art. Because the anti-93K serum 
recognizes the 99 kDa polypeptide, anti-93K positive clones 
isolated -as described above- can include -clones -representing 
TSL2. Several positive clones not representing TSL1 were 
identified by restriction mapping. One of these was deposited 
on January 28th 1993 as the plasmid pALK756 (see Table 1) with 
the Deutsche Sammlung von Mikroorganismen und Zellkulturen 
GmbH, Mascheroder Weg 1 B, D-3300 Braunschweig, Germany 
(Accession number DSM 7425). This plasmid comprises a 3.5 kb 
insert in pBluescript. The insert was not cut by the 
restriction enzymes, NotI, Sad, Spel or Xhol. The sequences 
of these and similar clones can be examined to identify an ORF 
that encodes the amino acid sequences of peptides isolated from 
the 99 kDa polypeptide (viz., SEQ ID N0:S 29 to 38 and 44 to 
49). Another well established procedure is to use these amino 
acid sequences to design nucleotide primers that can be used to 
amplify parts of the TSL2 gene by the polymerase chain 
reaction. When a part of the TSL2 gene has been isolated and 
sequenced by either procedure, the rest of the gene can be 
easily isolated as described for TSL1. 

Example 6. Characterization of TSS1 and TSL1 . 

The nucleotide sequence of TSSl encodes a polypeptide of 495 
amino acid residues with a calculated molecular mass of 56 kDa. 
This open reading frame starts with an ATG codon and ends with 
two TGA codons. The promoter region contains a TATA box at -186 
(see Fig 3) and the sequence CCCCGC at -270, which has been 
implicated in catabolite repression [Nehlin & Ronne, (1990) 
European Molecular Biology organization Journal 9, 2891-2898]. 
This may account for the low expression of trehalose synthase 
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in the presence of glucose disclosed in Example 2. 

i 

The open reading frame of TSL1 encodes a polypeptide of 1098 
amino acids, corresponding to a calculated molecular mass of 
5 123 kDa. This ORF starts with an ATG codon and ends with a TAA 
codon- Sixty base pairs downstream from the TAA codon is a 
possible TATATA transcription termination element [Russo et al. 
_ (1991) European Molecular Biology Organization- Journal— 10, 563- 
571] * 

10 

The promoter sequence of TSL1 contains two putative TATA boxes 
at -100 and -117. The promoter was searched for possible heat 
shock elements and four AAGGGG elements were found (-166, -180, 
-232 and -378). Of these, the one furthest upstream, at -378, 

15 was part of the sequence GGTAAAAGGGGCGAA , which corresponds 
well to the UAS. 3fi0 heat shock stress control element 
GGTAAGGGGCCAA [Marchler, G. et al f 19921 Yeast 8, S154]. Two 
copies of the canonical heat shock element GAANNTTC were found, 
one at -353 and the other at -425; thus, one on either side of 

20 the UAS. 3fi0 element. 

The sequence GCCCCTGCATTTT at -327 could be a MIG1 protein 
binding site (the consensus sequence is TCCCCRGATTNT) . MIG1 
appears to act as a repressor of transcription in the presence 
25 of glucose [Nehlin, J.O. & Ronne, H. (1990) European Molecular 
Biology Organization Journal 9, 2891-2898; Nehlin, J.O. et al 
(1991) ibid 10, 3373-3378]. These features of the TSL1 sequence 
are shown in Fig 4. 

30 The amino acid sequence encoded by TSL1 contains two 

pblyglutamine tracts, four Qs starting at amino acid 42 and 
five Qs starting at 164. Such glutamine-rich sequences have 
been associated with heteromeric protein-protein interaction 
[Gancedo, J.-K. (1992) European Journal of Biochemistry 206, 

35 297-313] • 



Fig. 5 discloses that the entire TSS1 gene exhibits 37 % 
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identity at the amino acid level to a 502 amino acid stretch 
from the middle of the TSL1 product. The genes are obviously 
closely related. 

Most surprisingly, the TSS1 gene is identical to the CIF1 gene 
that has been recently cloned and sequenced by Gancedo's group 
[Gonzales et al (1992) Yeast 8 183-192]. This disclosure 

reveals that special. methodology . is_required to handle -mutants 

containing modified forms of the TSS1 gene, because cifl 
mutants have severe defects in sugar metabolism, as discussed 
in the Detailed Description. It also explains, of course, why 
no recognisable short chain is present in the Klg 102 mutants, 
which carry the cifl mutation (see Example 7). Previously, it 
has been (tacitly) assumed that failure of cifl and fdpl 
mutant.s to express TPS activity is the consequence of a lengthy 
cascade of regulatory effects. The findings disclosed here and 
in Example 7 show that absence of the short chain of trehalose 
synthase is the primary defect, from which, in an as yet 
completely obscure way, the other regulatory defects of these 
mutants result. 

S.cerevisiae chromosomes were separated by pulsed field 
electrophoresis, with pulse times of 60 sec for 15 h and 90 sec 
for 9 h at 200 volts, as recommended by the instruction manual 
for the CHEF-DR II BioRad Laboratories, Richmond, California] . 
Genes were located using digoxigenin-labelled non-radioactive 
probes, following the instructions in the manual by Boehringer 
Mannheim. The following probes were used: a 2.1 kb Dral 
restriction fragment from TSL1 and a 1.9 kb Narl-Smal 
restriction fragment of TSS1 (the Smal site is in the linker 
between the insert and the vector; important restriction sites 
in TSSl and TSL1 are shown in Fig 6) . TSSl was located 
exclusively on Chromosome 2, which is where both FDP1 [Van de 
Poll and Schambert (1977) loc. cit.] and CIF1 [Gonzales et al. 
(1992) loc. cit.] have been located. This disclosure further 
strengthens the evidence for the identity of TSSl with CIF1 and 
FDP1. By using the GalH gene as a marker for chromosome 16 TSL1 
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was located exclusively on the adjacent Chromosome 13. 
Immediately downstream of TSL1 lies, in opposite orientation, 
the ARGRII gene, sequenced by Messenguy et. al* [(1986) 
European Journal of Biochemistry 157, 77-81] • The start of the 
5 overlapping sequence is shown in Fig 4. 



Example 7 A functional TSS1 gene is required for expression of 
both TPS and TFP activities 

10 

The S. cerevisiae mutant Klg 102, was obtained from Dan 
Fraenkel (Harvard Medical School) and has the genotype MATa, 
ural, leul, trp5, cif 1-102. It was routinely grown on YP/2% 
galactose or YP/2% glucose, and long term storage was under 
15 liquid nitrogen. As reported by others [Navon, G. , et al. 

(1979) Biochemistry 18, 4487-4499; Bariuelos, M. & Fraenkel, 
D*G* (1982) Molecular and Cellular Biology 2, 921-929], this 
mutant would not grow on YP/2% fructose, though revertants were 
frequent. 

20 

Six individual colonies from each of two substrains of Klg 102, 
ALKO 2669 and ALKO 2670, that differed in reversion frequency 
and colony size, were streaked onto YP/2% fructose and YP/2% 
glucose at 30 °C. After 45 h, all 12 streaks were growing on 

25 glucose, although slower than the control yeast, X2180, but 

none showed any growth on fructose. After 4 days, five of the 
ALKO 2669 streaks showed several large, but isolated colonies 
on fructose and one ALKO 2670 streak showed several small 
colonies on fructose. From the glucose plates, three streaks 

30 from each substrain were chosen for the smallest number of 
revertants on the corresponding fructose plate, and used to 
inoculate 100 ml portions of YPD in 250 ml shake flasks, and 
grown at 200 r.p.m. and 30 °c. Three parallel flasks were 
inoculated with X2180. A600 and residual glucose in the media 

35 were monitored and samples were plated out quantitatively onto 
YP/2% glucose and YP/2% fructose. The ALKO 2669 cultures grew 
faster than the ALKO 2670 cultures, and both grew much slower 
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then X2180 (not shown) . * 

At appropriate times the cells were harvested, broken and ' 
analyzed as described in the General Materials and Methods . 
5 Results are shown in Table 5, 
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Table 5. Growth of Klg 102 and X2180 strains on YPD 



10 



The cultures were performed as described in the text. Residual 
glucose and cell mass are given as f respectively, g/100 ml and 
mg/ml of growth medium. Phosphoglucoisomerase (PGI) was 
determined as described in Example 11. PGI, TPS and TPP are 
given as U/g of wet cells (TPS was determined in the presence 

_of_5--mM-F-6P)_._ Trehaldse-is_given-as~mg/-g-of-wet-cells 

Viability Fru/Glu shows the number of cells able to 
grow on fructose as a percentage of the number of cells able to 
grow on glucose at the time of harvesting. Cells from the 
cultures 2670/1 and 2670/2 were combined for breakage and 
subsequent analysis. ND, not determined. 



15 



20 



25 



30 



35 



40 



Strain 


Age 
(h) 


Residual 
Glucose 
(g %) 


Cell Mass 
(mg/ml) 


PGI 
(U/G) 


TPS 
(U/G) 


TPP 
(tJ/G) 


Trehalose 
(mg/ g) 


Viability 
Fru/Glu 
(%) 


Klo 102 


cultures 














2569/1 


24 


ND 


4.3 


88 


SO. 02 


ND 


ND 


2.4 * 


2669/2 


48 


£0.02 


11.6 


81 


£0. 03 


0.034 


ND 


Sl. 7 


2669/3 


114 


none 


10.3 


ND 


ND 


SO. 02 


SO. 22 


Si. 8 


2670/1 
2670/2 


110 
110 


none 


9 .7 


89 


SO. 03 


0.081 


ND 


1.4 
4.0 


2670/4 


114 


none 


11.2 


ND 


ND 


sO.02 


SO. 19 


SO. 3 


X2180 cultures 














1 


24 


ND 


19.1 


93 


6.3 


1.7 


ND 


ND 


2 


110 


none 


31.7 


126 


6.3 


2.3 


ND 


ND 


3 


114 


none 


34.4 


ND 


ND 


2.9 


29.3 


ND 



These results show that TPS activity was below the detection 
level in the Klg 102 samples and less than 0.5 % of the value 
45 in X2180, which is typical of wild type S. cerevisiae . This 

agr es with previously reported results [Paschoalin, V.M.F., et 
al. (1989) Current Genetics 16 , 81-87]. Surprisingly, however r 
TPP activities were also very low, between < 1 % and 5 % of the 
X2180 values. Even this residual ability to hydrolyse 
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trehalose-6-phosphate is likely to be due to non-specific 
phosphatases. Paschoalin et_al. [(1989) loc. cit. ] claim that 
Klg 102 specifically lacks UDPG-linked TPS activity, but that, 
like the wild-type yeast S288C (which is the haploid form of 
5 X2180) , it contains an ADPG-linked activity. If this were true, 
and accepting the conventional view that trehalose synthesis in 
yeast proceeds via free trehalose-6-phosphate, Klg 102 should 

contain_signif icant__TPP^activity-.-Our-results disclose- that 

this is not the case. Furthermore, when we tested whether wild 
10 type yeast (X2180) was able to synthesise [ 14 C] -trehalose from 
[ U C]-G6P in the presence of UDPG or ADPG, we found significant 
activity only in the presence of UDPG. The assay systems used 
by Paschoalin et al. [(1989) loc. cit.] have been criticised by 
Vandercammen et al. [(1989) loc. cit.], /so we tested the 
overall reaction directly. Yeast extracts were incubated in 4 0 
mM HEPES pH 6.8 containing 1 mg BSA/ml, 10 mM MgCl 2 and 10 mM 
[U 1A C]-G6P (736 c.p.m./nmol) in the presence or absence of 5 
mM UDPG or 2 . 5 mM ADPG and presence or absence of 5 mM K 
phosphate. Reactions were stopped by boiling for 2 min and 
addition of AG1-X8 (formate) anion exchange resin, as in the 
TPP assay system described by Londesborough & Vuorio [(1991) 
loc. cit.]. Results are shown in Fig 7. Without UDPG or ADPG, 
radioactivity appeared in the resin supernatants , presumably 
due to phosphatases active on G6P. UDPG caused a clear increase 
in this rate in the absence of phosphate and a marked increase 
in the presence of 5 mM phosphate, which stimulates the TPP 
activity and inhibits the TPS activity of trehalose synthase. 
With UDPG and 5 mM phosphate, the increase in rate 
corresponded, after a lag phase, to 0.94 jMiol/min/g of fresh 
yeast, which is about 50 % of the TPP activity of this yeast at 
20 mM phosphate. ADPG, however, did not cause any significant 
increase in the rate of appearance of radioactivity in the 
resin supernatant, indicating that no ADPG-linked TPS activity 
was present. 

Western blots of the homogenates of Klg 102 and X2180 yeast are 
shown in Fig. 8. The origin of the bands marked D is not clear: 
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they may be degraded short chain. X2180 shows a strong 57. kDa 
band, due to the short chain of trehalose synthase ^nd several 
weak bands at 100 to 130 JcDa due to intact and truncated 
versions of the long chain. In contrast, although the Klg 102 
samples showed stronger long chain bands, because more yeast 
sample was applied to the gel, they showed no trace of a short 
chain band. Thus f Klg 102 does not contain a recognisable form 

.of _ the_product_of _ the__TSS l _gene_(it_ might _contain-a- truncated 

version lacking the epitopes recognised by our polyclonal 
antibodies) , but contains normal amounts of the TSL1 product. 
Furthermore, the TSL1 product appears to increase as Klg 102 
traverses the diauxic lag (compare e.g. lanes 3 and 2 of Fig. 
8), suggesting that expression of the long chain of trehalose 
synthase in this yeast increases when all glucose is consumed. 
In wild type yeast f increases in both short and long chains 
occur concomitant with the increases in TPS and TPP activities 
when glucose is consumed (Example 2) . 

These results disclose that the failure of Klg 102 to express 
immunologically recognisable short chain of trehalose synthase 
is correlated with the absence of both TPS and TPP activities. 
This unexpected behaviour, in contradiction of the views of 
Paschoalin et al. [(1989) loc. cit.], indicates that a 
functional short chain is required to assemble a trehalose 
synthase with either partial activity. 

Similar experiments were done with S. cerevisiae , strain MVS 807 
(obtained from Johan Thevelein, Laboratorium voor Moleculaire 
/ Celbiologie, Instituut voor Plantkunde, Heverlee, Belgium) , 
which carries the fdpl mutation, which is allelic to CIF1 and 
TSSl. This strain grew poorly on glucose (fructose was not 
tested) and so was grown on galactose. Stationary phase cells 
contained 6 ± 6 % of normal TPS but about 20 % of normal TPP. 
Western analyses showed the presence of a band at 57 kDa 
recognised by anti-57K serum as well as normal long chain 
bands, so the mutation in MV6807 must be an aminoacid 
substitution. Apparently, this substitution causes a greater 



/ 



WO 93/17093 



PCfiFi<"3/ftO?349 



53 

decrease in TPS activity than TPP activity. 

Example 8. Biochemical evi dence that a long chain of trehalose 
synthase is required for TPP activity 

5 

Truncated trehalose synthase containing the short (57 kDa) 
chain and the 86 and 93 kDa long chain fragments was prepared 

accor ding to the method o f Lond esb oroug h & Vuorio _(199l)^ loc. 

cit.] for proteolytically activated TPS/P complex. TPS and TPP 

0 activities were assayed as described by Londesborough & Vuorio 
[(1991) loc. cit.]. [N-ethyl-l- u C]- maleimide (ethyl-labelled 
NEM; 40 mCi/mmol) was NEC-454 from New England Nuclear. 
N-ethyl- [2, 3- 1A C] -maleimide (ring-labelled NEM; 6 mCi/mmol) was 
CFA 293 from Amersham International. Both were obtained as 

5 solutions in n-pentane and the manufacturer's stated specific 
activities were assumed to be correct. Unlabelled N-ethyl- 
maleimide (NEM) was E-3876 from Sigma. It was -dissolved in 25 
mM HEPES pH 7.0 immediately before use and standardized by 
absorption measurements at 305 run, assuming an E"* of 0.62. 

) 

Treatment of truncated trehalose synthase with 1.9 mM NEM at 24 
°C in the presence of about 0.17 mM dithiothreitol (which 
presumably rapidly consumes about 0.34 mM NEM) caused a rapid 
and essentially complete (> 98 %) loss of TPP activity, but 

1 little (< 24 %) loss of TPS activity (Fig. 9) . This suggested 
that NEM modified one or more amino acid (presumably cysteine) 
side chains that are required intact for TPP but not for TPS. 

To permit quantitative experiments with low concentrations of 
labelled NEM, the dithiothreitol in the enzyme preparation was 
removed by gel-filtration through Pharmacia NAP5 columns 
equilibrated with 1 mg BSA/ml of 25 mM HEPES pH 7.0 containing 
2 mM MgCl 2 , 1 mM EDTA and 0.2 M NaCl. Recoveries of TPS and TPP 
activities through this gel-filtration were above 85 %. 

In one experiment, 2.0 fil of 2.4 mM ethyl-labelled NEM was 
mixed with 150 /xl of gel-filtered enzyme and incubated at 23 
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°C. Samples (10 jxl) taken at various times up to 190 min were 
mixed with 60 /il of Laemmli sample buffer (the mercapto-ethanol 
in this buffer should destroy residual NEM) , boiled for 5 min 
and subjected to SDS-PAGE. At closely similar times (and also 
5 at 23 h) other samples (10 jLtl) were mixed with 100 /zl (for TPS) 
or 700 (il (for TPP) of 5 mg BSA/ml 25 mM HEPES pH 7.0 
containing 2 mM MgCl 2 , 1 mM EDTA, 0.2 M NaCl and 1 mM 
dithi othr eito 1 ( the dithiothreitol should destroy^ residual NEM) 
and assayed for TPS and TPP. The enzyme dilution used for the 
10 TPP assay was sufficient that radioactivity from the NEM (about 
1/3 of which remains in the resin supernatant) did not 
interfere with the TPP determinations. 

After electrophoresis, the upper (cathode) buffer, containing 

15 most of the added radioactivity, was completely removed before 
disassembling the apparatus. The gel was then fixed, stained 
and destained as described by Laemmli [(1970) Nature, Londofi 
227, 680-685] and dried. An autoradiogram of this gel (Fig. 10) 
showed that the 93 kDa band (and also BSA) became labelled 

20 during the experiment, while the 86 and 57 kDa bands were much 
more weakly labelled. The Coomassie blue stained bands and 
adjacent, empty areas (as blanks) were cut out of the dried gel 
(in later experiments, they were cut from undried gels) , broken 
up and extracted overnight with 1 ml of 5 % SDS in pre-blanked 

25 scintillation vials. Then 10 ml of a toluene/ Triton XlOO-based 
scintillant was added, and the tubes were repeatedly counted 
using a wide energy window to minimise quench effects. After 10 
h constant counting levels were reached. Excess radioactivity 
was calculated by subtracting a blank value obtained from empty 

30 regions of the gel. Results are shown in Fig. 11. In control 

experiments, in which enzyme was omitted, it was shown that the 
excess radioactivity found in the 93 and 86 kDa bands did not 
originate from potential labelling of impurities in -the BSA. 

35 Fig. 11 shows that label from NEM enters mainly the 93 kDa 
fragment of the long chain, with relatively small amounts 
entering the 86 kDa fragment and the 57 kDa short chain. Also, 
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10 



15 



the amount of label entering the long chain fragments (93 + 86 
kDa) is roughly proportional to the loss of TPP activity, but 
lags increasingly behind this loss: at 10.5 min 30 % of the 
initial TPP was lost and 0.20 moles of NEM had entered the long 
chain fragments per mole (150 Kg) of enzyme, whereas at 190 
min, 56 % of TPP was lost and 0.32 moles of NEM had entered the 
long chain fragments. Possibly, since trehalose synthase may be 
an octamer (its native molecular mass is about 8Q0 kDa), _ 
reaction of one long chain with NEM can eventually lead to loss 
of activity associated with the other long chains in the 
octamer. Fig. 12 collates data from several experiments, using 
both ring- and ethyl-labelled NEM. Parallel experiments with' 
identical concentrations of ring- and ethyl-labelled NEM 
suggested that about 25 % of the radioactivity from ethyl- 
labelled NEM originally fixed in the protein was lost during 
SDS-PAGE processing (some loss is expected in acidic 
condition) , and the results with ethyl-labelled NEM have been 
corrected accordingly. Within the limits of accuracy (a 
specific activity of 30 TPS units/mg was used to calculate the 
mass of protein and a dimer molecular mass of 150 kDa was 
assumed for the truncated enzyme) complete loss of TPP 
reflected incorporation of rather less than l mole of NEM into, 
specifically, the long chain fragments. 

55 Another reagent with high specificity for cysteine, 

dithiodinitro-benzoate (DTNB), also caused a specific loss of 
TPP activity: after 10 min treatment with 0,9 mM DTNB over 95 % 
of the TPP was lost and less than 28 % of the TPS. 

0 These findings disclose that TPP activity requires a long chain 
with a proper structure, because modification of a single amino 
acid (presumable cysteine) residue in the 93 kDa fragment 
eliminates TPP but not TPS activity. Sequencing data given in 
Example 3 disclosed that the 93 kDa band contained material 
from both the 99 kDa and 123 kDa long chains. Thus, the present 
results disclose that either the 99 kDa or the 123 kDa or both 
long chains are involved in TPP activity. 
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Example 9. An isolated 99 kDa polypeptide from trehalose 
synthase contains TPP activity 

Because the long and short chains of trehalose synthase were 
5 dificult to separate by usual chromatographic procedures, 

fractionations were attempted in the presence of a non-ionic 
detergent. During fractionation with a NaCl gradient on 
DEAE-cellulose (Whatman DE52) in 1 % Triton X100 at pH 8.0, the 
enzyme was recovered in about 90 % yield at 140 mM NaCl. Some 

10 minor polypeptides (e.g. the weak 68 kDa polypeptides visible 
in Fig 1) were removed, but the main 57, 99 and 123 kDa 
polypeptides were not resolved. However, the ratio of the 99 
and 123 kDa bands changed from about 1.5 to 0.3 across the 
enzyme peak, while concomitantly the TPP/TPS ratio decreased 

15 steadily from 0.54 to 0.42 (data not shown). This suggested 

that the procedure was partially resolving trehalose synthase 
molecules, enriched in the 99 kDa polypeptide from those 
enriched in the 123 kDa polypeptide and that the former had a 
relatively higher TPP activity. By extrapolation it can be. 

20 calculated that the TPP/TPS ratio of (hypothetical) enzyme 
containing only 57 and 99 kDa chains would be 0.65 ± 0.1O, 
whereas that of enzyme with only 57 and 123 kDa chains would be 
0.32 ± 0.10. 

25 Because the long chain appears to contain an avid phosphate 
binding site (see Examples 10 and 12) , chromatography on 
phosphocellulose was attempted. Native trehalose synthase (4.2 
TPS units) was transferred above a PM10 membrane in an Amicon 
cell to 25 mM HEPES pH 7.0 containing 2 mM MgCl2 , 1 mM EDTA, l 
30 mM dithiothreitol and 0.3 % Triton X100 (HMED/0.3 %T) and 

applied to a 0.7 x4. 2 cm column of phosphocellulose (Whatman 
Pll-cellulose) equilibrated with the same buffer. The column 
was washed with 4 ml of HMED/0.3 %T and. developed with a linear 
gradient from zero to 0.6 M NaCl in 60 ml of HMED/0.3 %T at 5 
ml/h. By 0.35 M NaCl only traces of TPS had been elut d (< 3 % 
in the first 9 ml and < 9 % spread between 0.15 and 0.35 VL 
NaCl) . The gradient was interrupted and the column was washed 
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sequentially with (a) 8 ml of 10 mM fructose-6-phosphate in 
HMED/0.3 %T/0.35 M NaCl, (b) 6 ml of HMED/0.3 %T/0.6 M NaCl and 
(c) 0.2' M K phosphate pH 7.0/2 mM MgCl 2 /l mM EDTA/1 mM 
dithiothreitol. No TPS or TPP activity was recovered except in 
a single 1.5 ml fraction in which the 0.6 M NaCl began to 
elute. This contained 12 % of the applied TPP, but <0.1 % of 
the applied TPS. 



Fractions were examined by SDS-PAGE (Pig. 13) , which showed: 

(1) almost pure short chain eluted at and just before the start 
of the NaCl gradient in fractions devoid of enzyme activity; 

(2) traces of short and long chain eluted diffusely at about 
0.2 to 0.35 M NaCl in fractions containing altogether < 7 % of 
the applied TPS activity; (3) at least 50 % and possibly all of 
the applied 99 kDa polypeptide eluted at 0.6 M NaCl in the 
fraction containing 12 % of the applied TPP activity; and (4) 
most of the 123 kDa polypeptide remained bound to the column. 

Intact trehalose synthase has also been fractionated on 
phosphocellulose in the absence of Triton, and with elution by 
a simple linear gradient from 0 to 0.6 M NaCl. Pure or nearly 
pure 99 kDa polypeptide eluted at about 0.45 M NaCl and 
contained specific TPP activity ( 14 c-G6P was not hydrolyzed) . 
This activity differed from the TPP activity of intact 
trehalose synthase in that the ratio of activities at 25 mM 
phosphate and 50 mM Hepes was between 1.5 and 3 in different 
experiments (cf, this ratio is 5 to 6 for intact trehalose 
synthase) . Furthermore, during storage of the isolated 99 kDa 
polypeptide at 0 °C, the TPP activity at 25 mM phosphate 
decreased and that at 50 mM Hepes increased, until the ratio 
was about 0.7 after 7 weeks. 

These findings disclose that the 99 kDa polypeptide isolated 
from intact trehalose synthase is a specific trehalose-6-: 
phosphatase, but that its catalytic properties are unstable 
and differ from the TPP activity of intact trehalose synthase. 
Together with the disclosure in Example 7 that yeast requires a 
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properly functional TSS1 gene to exhibit TPP activity, the 
results suggest that proper folding of the 99 kDa polypeptide 
requires the presence of the 57 kDa chain. 

5 These findings also disclose that when the short chain is 

separated from the long chain by chromatography in a buffer 
containing 0.3 % Triton, in which intact trehalose synthase is 
stable, it rapidly looses any _TPP or TPS activity it possessed 
when correctly folded in the trehalose synthase. 

10 

The findings also indicate that the full-length long chain has 
extraordinarily high affinity for phosphocellulose, which is 
consistent with the location of a high affinity phosphate 
binding site in a terminal portion of this chain as suggested 
15 by Examples 10 and 12. 

Example 10 Truncation of the 123 kDa long chain of trehalose 
synthase bv trypsin in vitro dramatically increases TPS 
activity 

20 

Removal, of the N-terminal 325 or so amino acids from the 123 
kDa long chain of intact trehalose synthase by treatment with 
trypsin in vitro produces an enzyme with catalytic properties 
like those of the truncated enzyme purified by Londesborough & 

25 Vuorio [(1991) loc. cit. ]. In one experiment intact trehalose 
synthase (0.28 TPS units, s; 9.4 fiq) was incubated with or 
without 0.5 /zg of trypsin at 30 °C in 250 pi of 13 mM HEPES pH 
7.0 containing 1 mM MgCl 2 , 0.5 mM EDTA, 0.5 mM dithiothreitol f 
0.2 M NaCl and 0.5 mM benzamidine. Its TPS activity was 

30 determined at intervals using standard assay mixtures 

(containing 5 mM F6P) containing no or 4 mM K phosphate pH 6.8, 
and samples were prepared for SDS-PAGE analysis immediately 
before and 48 min after addition of the trypsin. 

35 During the first 48 min the TPS activity measured in the 
absence of phosphate decreased faster in the presence of 
trypsin than in its absence. However, in the first 10 min, 
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trypsin caused a 4 -fold increase in the activity measured at 4 
mM phosphate, and by 48 min the activities with and without 
phosphate were essentially equal (Fig. 14). By 48 min, the 123 
kDa full length long chain had disappeared and been replaced by 
5 a doublet of polypeptides at 85 kDa (Fig. 15) . In contrast, the 
short chain (57 kDa) was unchanged and the 99 kDa band was only 
slightly decreased in strength. The changes in TPS activity 
were accompanied by loss of about 5 0_ % of the TPP activity 

10 Which part of the 123 kDa chain was removed by trypsin was 

determined as follows. Intact trehalose synthase (180 /Ltg) was 
transferred to 0.5 ml of 25 mM HEPES pH 7.0 containing 2 mM 
Mgci 2 , 1 mM EDTA, 1 mM dithiothreitol and 0.2 M NaCl using a 
Centricon 30 tube, and then treated with 11 /zg trypsin at 25°C. 

15 The standard TPS activity did not decrease during the trypsin 
treatment, whereas TPS activity measured in the absence of F6P 
and presence of 10 mM phosphate increased from 26 % to 73 %* of 
the standard activity during the first 30 min of treatment. 
After 68 min treatment, when SDS-PAGE analysis showed the 

20 complete disappearance of the 123 and 99 kDa bands and 

appearance of a doublet with apparent molecular mass about 85 
kDa (the components differing by about 1.5 kDa), the mixture 
was centrifuged through a Centricon 30 tube to separate the 
tryptic peptides from the core enzyme. The retentate was then 

25 boiled in 0.5 % SDS and again centrifuged through the Centricon 
3 0 tube. The combined filtrates were diluted to 0.1 % SDS and 
incubated for 18 h at 25 °C with 4 % by weight of 
endoproteinase Glu-C (Boehringer) . The peptides were then 
separated by HPLC using a DEAE pre-column and sequenced as 

30 described in Example 3. 

Twenty sequences were obtained (Seq ID NOs 58 to 77 in Table 
6). Fifteen of these were found in the N-terminal 3 25 amino 
acids coded by TSL1. One (peptide 1407, recovered at less than 
half the yield of the others) was amino acids 1089 - 1093, 
i.e., 5 amino acids from the C-terminus of the protein coded by 
TSLl. This peptid is presumably derived by endoproteinase Glu- 
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C cleavage of the tryptic peptide starting after Lys 1079- Both 
the 86 and 93 kDa long chain fragments in the truncated 
trehalose synthase purified by Londesborough & Vuorio [(1991) 
loc. cit. ] are disclosed in Example 3 to contain a peptide 
5 (1483b & 1484b) derived from Alal064 to Lysl079, confirming 

that the truncated polypeptides extend at least this close to 
the C-terminus of the full length 123 kDa chain. The N- terminal 
peptide furthest from the N-terminus was peptide 1443 , J obtained 
by cleavage after Arg 335, Thus, the truncated long chain 

10 extends from Ser 336 to Lys 1079 or Asp 1098 f and is predicted 
to have a molecular mass of 87.3 or 86.2 kDa. The SDS-PAGE 
analysis of trypsin-treated enzyme suggests both of these 
truncated chains are formed, and because the TPS activity in 
the presence of F6P changes little during the trypsin 

15 treatment, the two truncated chains probably have similar 
activities . 

Of the remaining four peptides in Table 5, two (1419b and 
1437b) are still unidentified, but may originate from the 99 
20 kDa polypeptide, whereas two (1442 and 1451) clearly originate 
from that polypeptide. Thus, peptide 1442 is identical to 
peptide 1307a of Table 4, and the first 5 amino acids of 
peptide 1451 are identical to peptide 1297a (Table 4). 

25 These results disclose that removal of the N-terminal 325 amino 
acids of the long chain, with or without removal of the O 
terminal 19 amino acids, results in a trehalose synthase that 
is relatively insensitive to inhibition by phosphate, and does 
not require F6P for full activity. Analysis of the secondary 

30 structure of the long chain according to Gamier et al [(1978) 
Journal of Molecular Biology, 120, 97-120] suggests that 
whereas the O terminal 700 amino acids are likely to be in 
alpha-helices or beta-sheets, the N-terminal 360 amino acid 
portion of the protein is relatively, devoid of such structures. 

35 Taken together, these data suggest that the N- terminal 330 or 
so amino acids comprise a distinct domain, that confers 
regulatory properties upon the TPS activity of trehalose 
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synthase, including sensitivity to inhibition by phosphate and 
a requirement for F6P to express full catalytic activity. Thus, 
the TSL1 gene product must also be involved in TPS activity. 

5 Table 6 Peptides release d from intact trehalose synthase during 
activation by limited treatment with trypsin. 

W hen two seq uences wer e obtained from the same HPLC peak, they 

are shown as a and b sequences, assigned according to the 

10 sequences predicted from the TSL1 gene. Tentative 

identifications from the amino acid sequencer are shown by one 
letter codes and double queries; unidentified residues Xaa. (In 
the Sequence Listings also tentative identifications are 
indicated as Xaa) . The location of each amino acid sequence in 

15 the long (123 kDa) chain of trehalose synthase in Fig 4b is 
shown below the sequence. 



14 00 Leu-Leu-Val-His-Ser-Leu-Leu-Asn-Asn-Thr-Ser-Gln-Thr-Ser- 
2 0 Leu-Glu-Gly-Pro-Asn 

(SEQ ID NO: 58) (181-200) 

1401 Ser-Ser-Thr-Thr-Asn-Thr-Ala-Thr-Leu-Xaa-Xaa-Leu-Val-Ser- 
Ser-Xaa-Ile-Phe-Met-Glu 
25 (SEQ ID NO:59) (84-104) 

1406 Ala-G??-Asn-Arg-Pro-Thr-Ser-Ala-Ala-Thr-Ser-Leu-Val-Asn- 
Arg 

(SEQ ID NO: 60) (210-24) 

0 

1407 Xaa-Phe-Thr-Ile-Ile-S?? 
(SEQ ID N0:61) (1088-93) 

1408 Asn-Leu-Thr-Ala-Asn-Ala-Thr-Thr-Ser-His-Thr-Pro-Thr-Ser- 
5 Lys 

(SEQ ID N0:62) (105-19) 
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1409 Phe-G??-G??-Tyr-Ser-Asn-Lys 
(SEQ ID NO: 63) (319-25) 

1416 S??-Pro-S??-Ala-Phe-Asn-R?? 
(SEQ ID NO: 64) (77-83) 

14 17a Ile-Ala-Ser-Pro-I le-Gln-T? ? -Glu 

__(SEQ_ID NO_:65.) (145-52.) - 

1417b Gln-Arg-Pro-Leu-Leu-Ala-Lys 
(SEQ ID NO: 66) (257-63) 

1418 Phe-Phe-Ser-Pro-Ser-Ser-Asn-Ile-Pro-Thr-Asp-Arg 
(SEQ ID N0:67) (133-44) 

1419a Ala-Leu-Ser-Asn-Asn-Ile-Ser-Gln-Glu 
(SEQ ID NO: 68) (47-55) 

1419b A??-L??-s??-Tyr-Thr-Pro 

(SEQ ID NO: 69) (not found) 

142 0 Ile-Ala-Ser-Pro-Ile-Gln-Gln-Gln-Gln-Gln-Asp-Pro-Thr-Ala- 

Asn-Leu 

(SEQ ID NO: 70) (159-74) 

1437a Thr-Met-Leu-Lys-Pro-Arg 
(SEQ ID NO: 71) (120-25) 

1437b Ile-Ile-Glu-Asp-Glu-Ala 

(SEQ ID NO: 72) ((not found) 

1438 Ile-Thr-Pro-His-Leu-Thr-Ala-Ser-Ala-Ala-Lys 
(SEQ ID NO: 73) (246-56) 

143 9 Ser- Leu-Val-Ala-Pro-Ala-Pro-Glu 

(SEQ ID NO: 74) (56-63) 
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1442 Lys-Pro-Gln-Asp-Leu-Asp-Asp-Asp-Pro-Leu-Tyr-Leu 
(SEQ ID NO:75) (from 99 kDa) 

1443 Lys-Tyr-Ala-Leu-Leu-Arg 
(SEQ ID N0:76) (330-35) 



1451 Gln-Leu-Gly-Asn-Tyr-G? ? -Phe-Tyr-Pr o-Val -Tyr 

(SEQ ID MO: 77) ((from 99 kDa) ^ 

10 

Example 11 Identification of the TPS activator as 
phosphocrlucoisomerase 

TPS activator was transferred to 0.1 M Tris/HCl pH 9.0 above a 
15 PM10 membrane in an Amicon cell. A 300 ill sample (34 jug) was 
digested for 20 h at 37 °C by 0.8 /xg of lysylendo-peptidase C 
(Wako) . Peptides were separated by HPLC and sequenced as 
described in Example 3. All five sequences obtained and 
disclosed in Table 7 are identical to sequences found in yeast 
20 phosphoglucoisomerase (PGI) . 
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The PGI sequences are from Tekamp-Olson, P. , et ai. (1988) Gene 
73, 153-161. 

TPS-Activator Peptide PGI Residues 

TA1156 TFTNYDGSK 51 - 59 

(SEQ ID MO: 39) 

TA1158 TGNDPSHIAK 241 - 251 

(SEQ ID NO: 40) 



TA1159 IYESQGK 24-30 
15 (SEQ ID MO: 41) 

TA1160 AEGATGGLVPHK 456 - 467 

(SEQ ID NO: 42) 

20 TA1161 LATELPAXSK 11 - 19 

(SEQ ID MO: 43) 



25 The PGI activity of a sample of TPS activator that had been 
stored for several months at 0 °C was measured in 50 mM 
HEPES/KOH pH 7.0, 5 mM MgC^, 5 mM F6P and 0.4 mg/ml NADP. A 
specific activity of 190 U/mg was found. 

30 These findings disclose that TPS activator from S. cerevisiae 
is identical to PGI. Example. 12 discloses that F6P is a 
powerful activator of the TPS activity of intact, but not of 
truncated, trehalose synthase. Because the assay mixtures for 
TPS contain G6P, it is clear that TPS activator can activate 

35 - TPS by producing F6P from the substrate G6P. This is a complete 
explanation for the activation. Thus, at initial concentrations 
of 6.7 mM G6P and 1.9 111M F6P (i.e. G6P/F6P = 3.5, the 
experimental equilibrium ratio) the rate was independent of TPS 
activator and equal to that at 9 mM G6P with TPS activator. 

40 * Previous investigations [Londesborough & Vuorio (1991) loc. 
cit.] had to use crude preparations of intact trehalose 
synthase because pure intact trehalose synthase was not 
available. Although the effectiveness of TPS activator 
preparations was reported to vary between different enzyme 
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preparations, under certain circumstances data were obtained 
that suggested TPS activator might interact stoichiometrically 
with native trehalose synthase [ Londesborough & Vuorio (1991) 
loc. cit. ]• The present findings show that this suggestion was 
completely incorrect. The findings also imply that kinetic data 
in the literature are confused, because some preparations of 
so-called "trehalose-6-phosphate synthase" will have contained 
PGI whereas some may not. With the former preparations, the 
activator F6P will have been generated from the substrate G6P, 
but the amount so generated will have depended upon the details 
of the experimental procedure used. 

Example 12 . The different kinetic behaviours of intact and 
truncate d trehalose synthase 

Truncated trehalose synthase was prepared as described by 
Londesborough & Vuorio [(1991) loc. cit.] and contained the 57 
kDa short chain and 86 and 93 kDa fragments of the long chain. 
Intact trehalose, synthase was prepared as in Example 1 . Kinetic 
assays were done at 3 0 °C as described in General Methods and 
Materials . 
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fa) The TPS Partial Activity 

> \ 

Table 8. Inhibition of the TPS activities of intact and 
truncated enzyme bv phosphate at 5 mM F6P 

5 

The effect of adding K phosphate pH 6.8 to standard assay 
mixtures (10 mM G6P, 5 mM UDPG and 5 mM F6P) is shown. For each 
enzyme, the activity without phosphate is s et at 10 0_ _%. 

10 Added Phosphate Intact Enzyme Truncated Enzyme 

None t 100 % 100 % 

1.3 mM 69 % 94 % 

4.0 mM 14 % 83 % 



15 



The TPS activity of intact enzyme was much more sensitive to 
inhibition by phosphate than was that of the truncated enzyme 

20 (Table 8). The results in Table 8 underestimate the difference 
between the phosphate responses of intact and truncated enzyme, 
because F6P partially reverses the phosphate inhibition of 
intact enzyme (see below) but has virtually no effect on 
truncated enzyme. Table 9 shows the effect of shifting from the 

25 salt conditions of the standard assay (40 mM HEPES/KOH pH 6.8, 
10 mM MgCl 2 ) to conditions closer to those of yeast cytosol. In 
the absence of F6P, the shift caused 67 % inhibition of intact 
enyzme (from 43 % to 14 % of the standard activity) but only 10 
% inhibition of truncated enzyme (from 96 % to 86 %) . 

30 

Table 9. Effect on the TPS activity of intact and truncated 
enzyme of shifting to more physiological salt conditions 

For measurements at "physiological conditions", 1.3 mM K 
35 phosphate and 0.1 M KC1 were added to the standard assay 
mixtures and the MgCl 2 was decreased from 10 to 2.5 mM. 
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Standard Cond. Physiological Cond, 

f5 xnM F6P) No F6P 5 mM F6P No F6P 

Intact 100 % 43 % 72 % 14 % 

Truncated 100 % 96 % 90 % 86 % 



These results disclose the insensitivity of the TPS activity of 
10 truncated trehalose synthase to physiological phosphate 
TOricehtralTibh^ of F6P at a 

concentration well above the normal value in yeast cytosol 
(between 0.1 and 1 mM; Lagimas, R. & Gancedo, C. (1983) 
European Journal of Biochemistry 137, 479-483). 

15 

Fig. 16 illustrates the F6P-dependence of the TPS activity of 
intact enzyme at different phosphate concentrations. Double- 
reciprocal plots of these data (not shown) indicate that at 1.3 
mM phosphate, and perhaps at 4 mM phosphate, sufficiently high 

20 concentrations of F6P completely overcome the inhibition by 
phosphate. With no added phosphate, F6P caused a maximum 
activation of 2.5-fold, with a K, A of 60 ^M. At 1.3 mM 
phosphate, the maximum activation was at least 20-fold, and the 
K, A was 1.4 mM F6P. The slopes of these double-reciprocal plots 

>5 varied linearly with the square of the phosphate concentration, 
suggesting that two phosphate binding sites are involved. At 4 
mM phosphate, which is still within the probable range of 
phosphate concentrations in yeast cytosol [Lagunas & Gancedo 
(1983) loc. cit], inhibition was so severe that even 10 mM F6P 

JO permitted only 40 % of the activity observed under standard 

conditions. Thus, expression of a truncated trehalose synthase 
in yeast would be expected to cause a large increase in the 
intracellular specific activity of the enzyme. 

5 Fructose-l-phosphate, f ructose-i, 6-bisphosphate, fructose-2, 6- 
bisphosphate and glucose-l-phosphate were test d at sub-optimal 
F6P concentrations (1 mM F6P at 1.3 mM phosphate). None caused 
activation at 5 or 2.5 mM concentrations; instead inhibitions 
of about 25 % occurred, probably due to competition with G6P 

0 and F6P. 
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fb) The TPP Partial Activity. 

At phosphate concentrations equal to or less than 1 mM, the 
progress curves of TPP reactions catalysed by truncated 
trehalose synthase accelerated markedly over at least the first 
10 min of reaction. This did not happen with intact enzyme. For 
the initial rates of reaction, intact enzyme was activated by 
smaller phosphate concentrations than was truncated. enzyme _ 
(Fig. 17) . For truncated enzyme, double-reciprocal plots of the 
activation (va = the rate with phosphate, v pf , minus the rate 
without phosphate, v 0 ) were linear when 1/va was plotted 
against 1/ [phosphate] , with a K ft of 3 mM phosphate. For intact 
enzyme these plots were non-linear, and linear plots resulted 
when 1/ [phosphate] 2 was used (Fig. 18). This, again, suggests 
that intact enzyme has two strong phosphate binding sites, one 
of which is lost in the truncated enzyme. For intact enzyme, 
half maximal activation was obtained at 0.6 mM phosphate. 

In the absence of phosphate, F6P did not affect the TPP 
activity of intact enzyme. At sub-optimal phosphate 
concentrations, 5 mM F6P caused modest (20 to 30 %) inhibitions 
of the TPP activity of both intact and truncated enzymes, and 
at saturating phosphate concentrations, smaller inhibitions (10 
to 15 %) were observed (data not shown) . 

These findings disclose a profound sensitivity of the TPS 
activity of intact trehalose synthase to physiological 
phosphate and F6P concentrations that is lost by truncation of 
the 123 kDa long chain to about 85 kDa. The effects of 
truncation are less marked on the TPP activities, both enzymes 
being activated by physiological phosphate concentrations, and 
neither showing a strong response to F6P. The data suggest that 
intact enzyme has two strong phosphate binding sites, one of 
which is located in the region of the 123 kDa long chain 
removed by truncation. The finding that the 123 kDa long chain 
could not be recovered from phosphocellulose, disclosed in 
Example 9 supports this conclusion. 
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Example 13 Expression of TPS activity in Escherichia coli cells 
transformed with TSS1 and TSLl 

> 

E.coli . strain HB101 (ALKO 683) was transformed with the 
5 plasmids pALK752 and pALK754 consisting of pBluescript 

containing TSS1 and TSLl, respectively (see Example 14 and Figs 
20a and 20c) . Transformants ALK03566 and ALK03568 containing, 
respectively , pA LK75 2 and^ pALK75 4 were .selected _and naa intained _ 
by growth in the presence of 50 fig/ml of ampicillin. Shake 

10 flasks containing Luria Broth with no ampicillin (ALKO 683) or 
15 fig/ml ampicillin (ALK03566 and ALK03568) were inoculated 
with 1 ml of a suspension (A600 = 1.5) of the appropriate cells 
and shaken at 250 rpm and 30 °C for 15 h. Cells were harvested 
(5 min and 3000g) , washed twice with water, suspended (1.5 g 

15 cells/3.7 ml) in HBMED containing 1 mM PMSF and 10 jzg/ml 

pepstatin A, and broken by two passes through a French press 
(Aminco) at 15 000 psi. Samples of the homogenates were 
centrifuged 20 min at 28 OOOg. Homogenates and supernatant s 
were assayed for TPS and TPP at once and the protein contents 

20 of the supernatants were determined (Table 10). 

Table 10. Expression of TPS activity in E. coli transformed with 
TSS1 and TSLl. 

25 Host (ALKO 683) and transformants (ALK03566 containing TSS1 and 
ALK03568 containing TSLl) were grown, harvested and broken as 
described in the text. Cell homogenates and supernatants were 
assayed at once for TPS, using the standard assay and a blank 
assay from which G6P and F6P were omitted, and for TPP. 

30 Activities are shown as mU/g fresh cells unless stated 
otherwise. 
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15 



20 



C$11 yield (g/200 ml) 

Homoaenates 
Standard TPS 
TPS Blank 
Net TPS 

Standard TPP 

Supernatants 
Standard TPS 
TPS Blank 
Net TPS 
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ALKO 683 
1.57 



361 ± 75 
363 ± 57 
0 ± 20 

1130 ± 70 



273 ± 73 
263 ± 21 
10 ± 50 



Net TPS (mtf/mg protein) 0.08 
Standard TPP 1130 ±100 



ALKQ3566 
1.51 



1065 ± 118 
227 ± 45 
840 ± 70 



699 ± 47 
155 ± 42 
540 ± 10 

4.50 

910 ± 90 



ALKQ3568 
1.56 



260 ± 23 
117 ± 77 
140 ± 50 



1110 ± 100 1190 ± 80 



233 ± 68 
135 ± 9 
100 ± 60 

0.87 

1020 ± 80 



25 



30 



35 



40 



Standard and blank TPS assays both showed accelerating progress 
curves and results in Table 10 are mean ± range of 5 min and 10 
min assays , which were handled separately to calculate - the net 
TPS activity. Essentially all of the standard TPS activity 
measured in the host cells and about half of that in ALK03568 
cells was due to a blank reaction (presumably a- phospho- 
diesterase) generating UDP from UDPG in the absence of G6P and 
F6P. The net TPS activity of host cells grown under these 
conditions was close to zero, whereas cells transformed with 
TSSl or TSL1 contained 840 or 140 mU/g fresh cells, most of 
which (64 % and 71 %, respectively) was soluble. Compared to 
the host preparation, the specific activities of the net TPS in 
the 28 000 g supernatants were increased about 50-fold 
(ALK03566) and 10-fold (ALK03568) . There are probably two 
reasons for the very low TPS activity of the host cells: 
trehalose-6 -phosphate synthase of E. coli is induced by high 
osmotic strength, and although some strains also acquire 
activity in stationary phase, the enzyme activity itself is 
strongly activated by higher (0.25 M) cation concentrations 
than in our assay conditions [Giaever et al (1988) Journal of 
Bacteriology 170, 2841-2849]. 
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No significant change in the TPP activities was observed. Host 
cells already contained 1100 mU/g of TPP measured in 25 mM 
phosphate (and more than 5 U/g measured in 25 mM Hepes buffer) . 
If transformation with the plasraids would have generated TPP 
5 activity with a TPP/TPS ratio the same as in pure trehalose 

synthase from yeast, then the increments in TPP (about 250 and 
40 mU/g for ALK03566 and ALK03568 r repectively) would have been 
undetectable^ f or MJC03568 ajtta clo se to the experimental error 
for ALK03566. 

0 

Western analyses (Fig/l9) showed that ALK03566 specifically 
expressed a 57 kDa' band recognized by anti-57K serum and more 
weakly reacting bands with smaller molecular masses. ALK03568 
specifically expressed bands recognized by anti-93K serum at 
5 about 60, 36 and 35 kDa (strong), suggesting that extensive 

degradation of the long chain occurs in ALK03568 or that TSL1 
is not correctly transcribed and translated. 

These results disclose (l) TPS activity can be transferred to 
) heterologous cells by either TSSl or TSL1, (2) a TSS1 gene 
product has TPS activity and (3) also one or more (degraded) 
products of TSLl has TPS activity. This latter finding is 
unexpected, because yeast containing a defective (Example 7) or 
disrupted (Example 14) TSSl gene lack TPS activity. Possibly 
» ALK03568 accumulates fortuitously degraded proteolytic products 
of the 123 kDa long chain of trehalose synthase that exhibit 
TPS activity even in the absence of the TSSl product. 

Obviously, transformation with TSSl (or TSLl) alone can be used 
to introduce a trehalose synthetic pathway to an organism, such 
as E. coli HB101, that already has the capacity to generate 
trehalose from trehalose-6-phosphate, possibly via a non- 
specific phosphatase. 
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Example 14. Trans formation of Yeast 

t 

fl) Assembly of complete genes and truncated versions of TSIil. 

Plasmids comprising the complete ORFs of TSS1 and TSL1 and a 
truncated ORF of TSL1 were assembled from appropriate 
immunopositive clones of the Haelll and EcoRI libraries used in 
Examples 4 and^ 5 to sequence these genes . 

fa) The TSS1 gene with its promoter (p AT. K 752) 

A 516 bp fragment was cut from Haelll clone 7 with restriction 
enzymes Dral and BstEII (see Fig 6 for restriction sites). The 
Dral site marks the beginning of the disclosed TSS1 sequence. 
This fragment was joined to Haelll clone 2 0 after this had been 
digested with BstEII and Clal (the Clal site was in the 
polylinker) and the Clal end filled with Klenow fragment. The 
sequence at the junction at the BstEII site in the religated 
plasmid (shown in Fig 20a) was confirmed by sequencing. 

fb) The TSS1 gene without its promoter (PALK7531 

Haelll clone 21 was cut with the restriction enzyme Tthllll. To 
this site the following linker (SEQ ID NO: 84, synthesized with 
the ABI DNA Synthesizer) was added: 

5'-CGGGAAGACA TAGAACTATG ACTACGGATA ACGCTAAGGC GCAACTGACC -3' 
3'-GCCCTTCTGT ATCTTGATAC TGATGCCTAT TGCGATTCCG CGTTGACTGG -5' 

This includes nucleotides -13 to +33 of TSS1 (see Fig. 4) but, 
when correctly orientated, introduces a Smal site at nucleotide 
-16 from the ATG start site. The plasmid (shown in Fig 20b) can 
be used to release with Smal the ORF of the TSS1 gene and about 
200 bp of its terminator for further constructions (e.g. 
expression vectors containing a new promoter) . 
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(c) The TSL1 gene with its promoter (PALK7541 

EcoRI clone 10 was cut with the restriction enzymes Mlul and 
Ndel, and the resulting 4.4 kb fragment was religated into the 
5 pBluescript Smal site. This procedure destroyed all these 
sites, so that these restriction enzymes cannot be used in 
further manipulations. The plasmid is shown in Fig 20c. 



(d) The TSL1 gene without its promoter f p AT.fC7 57) 

10 

Primers for the polymerase chain reaction (PCR) were made 
against the beginning of the TSLl gene and the sequence at 
+318. PCR (Techne PHC-2 Heat/Cool Dri -Block*) was used to 
synthesize (at 55 °C) a 325 bp fragment, which had at one end a 

15 Spel site and close to the other end a Bsml site. This fragment 
was digested with Bsml and can be ligated to pALK754 after 
cutting the latter with Spel (at the site in the pBluescript 
polylinker) and Bsml and filling the Spel site with Klenow 
fragment. For further manipulations, the gene can be isolated 

20 by cutting the resulting plasmid with Spel and, for example, 
Clal. 

(e) A truncated TSLl gene 

25 A truncated version of TSLl can be made by cutting pALK754 with 
Stul and joining the following linker (SEQ ID NO: 85) to this 
site: 

5 ' -GGGCCCAACA ACACAATGGT TACCCCGAAA TCGAGGGCGG GCAACAGG -3' 
3* ' — CCCGGGTTGT TGTGTTACCA ATGGGGCTTT AGCTCCCGCC CGTTGTCC -5' 

10 

The linker recreates the Stul site and creates a new ATG start 
codon at +627 in frame with the coding sequence. Thus, this 
version of the gene encodes a truncated 123 kDa long chain 
lacking the first 209 amino acids. It was disclosed in Example 
35 10 that removal of the first 325 or so amino acids proceeds 
without loss of catalytic activity, but releases trehalose 
synthase from strong inhibition by phosphate and the 
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requirement: for F6P. Hence, this construction can encode a 
truncated 123 kDa polypeptide leading to a trehalose synthase 
with increased activity at physiological phosphate and F6P 
concentrations. A new Smal site is included in the linker. The 
sequence flanking the new ATG on the 5 '-side resembles the 
original ATG flanking sequence and the surrounding nucleotides 
are in accordance with the sequences known to occur most 
_ frequently at pos itions -7 _to . +4 . _ _ _ 

(2) Disruption mutants . 

The TSS1 gene was disrupted to confirm that it is an essential 
gene in trehalose synthesis. The one-step gene disruption 
method [Rothstein, R.J. (1983) Methods in Enzymology 101, 202- 
211)] was used as. follows: 

Plasmid pALK752 was cut with Xcml. A blunted Sair-Xhol fragment 
containing the LEU2 gene from plasmid yEpl3 [Broach, J et al 
(1979) Gene 8 ,121-133] was ligated to the blunted Xcml site. 
The resulting plasmid was cut with Nsil and Pvul and the 
reaction mixture was run through a 0.8% low-melting point 
agarose gel. A band of 4 kb was excised from the gel and 
purified- S. cerevisiae strain S150-2B was transformed (using 
the one-step alkali-cation method of Chen et al [ (1992) Current 
Genetics 21, 83-84]) with the 4 kb DNA fragment containing the 
TSS1 gene interrupted by the LEU2 gene. Leu + transf ormants were 
selected on minimal plates lacking leucine and containing 
glucose or galactose, and the clones obtained were then grown 
on YPD or YP/2% galactose, respectively. 

As expected the phenotype of the disruptants resembled the fdpl 
and cifl phenotypes (see Example 2) . Only one transf ormant 
(ALK03569) was isolated on glucose and the several 
transf ormants isolated on galactose were unable to grow on 
glucose. The glucose transf ormant and the tested galactose 
transformant (ALKO3570) did not accumulate trehalose in 
stationary phase (£0.2 % of dry wt. ) , lacked TPS and had low 
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TPP activity (< 10 % of wild type) . The 57 kDa band could not 
be seen on Western blots. Southern analysis (Fig 21) showed 
that the TSS1 gene had been disrupted by a LEU2 gene, but the 
TSL1 gene was intact. 

5 

Another mutant, WDC-3A (see Table 1) with a disrupted TSS1 gene 
was obtained from the laboratory of Dr. C. Gancedo (Instituto 
de Investigaciones Biomedicas, CSIC, Madrid , Spain) as a 
cifl::HIS3 disruptant. This mutant was easier to transform than 

10 were the tssl::LEU2 disruptants, and so it was used to confirm 
that the TSS1 gene on a plasmid can confer TPS and TPP 
activities, trehalose accumulation and improved stress 
resistance. WDC-3A was transformed with the plasmid pMB4 (see 
Table 1; the plasmid contains an intact CIF1 s TSS1 gene and a 

15 selectable URA3 marker) and trans formants selected in the 

absence of uracil. Western analyses (not shown) indicated that 
the transfdrmants has acquired the 57 kDa band absent from WDC- 
3A. The parent and a transformant were grown in parallel in 
minimal medium containing 2 % galactose and (transformant) no 

20- uracil or (parent) uracil. Duplicate cultures of each strain 
were harvested in early stationary phase after 28 h growth 
samples taken for studies of stress resistance and the rest 
used for trehalose and enzyme assays (Table 11). 

15 Table 11. Analysis of WDC-3 A and its PMB14 transformant . 

Duplicate cultures were analyzed separately for trehalose and 
combined for en2yme assays. TPS activities were corrected for 
UDPGase activity in the absence of G6P and F6P . 
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Cell mass (g/100 ml medium) 
Trehalose (% of dry wt.) 
Whole homoaenates 
TPP (U/g fresh yeast) 
TPS (U/g fresh yeast) 
2B ±°?Q 5_ A u P e ™JM^nts 
TPP (U/g fresh yeast) 
TPS (U/G fresh yeast) 
TPS (mU/mg protein) 



WDC-3A 
ftssl: ;HIS31 
2.6 
0.84, 0.81 

0.02 

0.84 ± 0.37 



0.01 

0.22 ± 0.20 
4.0 



PMB14 fTSSl) 
Trans formant 
2.8 
2.9, 3.0 

0.84 
17.9 ± 3.5 

0.67 
14.6 ± 3.3 
223 



These results disclose that introduction of TSS1 on a plasmid 
can restore both TPS and TPP activities and increase the 
trehalose content of an organism. The TPP/TPS ratio (5 %) is 
much lower than that (about 35 %) of purified trehalose 
synthase whereas the baker' yeast used in Example 1 and the 
X2180 used in Example 2 both have TPP/TPS ratios in their 
homogenates close to that of pure enzyme. This suggests that 
transformation with TSS1 in pMB14 increases TPP only up to a 
limit set by the genetic background of the host (probably the 
amounts of 99 and 123 kDa polypeptides present) but causes a 
larger increase in TPS due to activity associated also with 57 
kDa chains not incorporated into the trehalose synthase 
complex. 

Samples of the transformant and host were frozen in water at 1 
fiq yeast/ml and kept for 5 days at -20 °C. The Viability was 
then tested on plates containing YP/2 % galactose. After 
freezing stress, 1.0 ±0.1 % of the transformants and < 0.05 % 
of the host cells were viable. These results disclose that 
transformation of an organism with TSSl can increase its 
resistance to freezing-stress. 
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(3) Strategies for transformation . 

Laboratory strains of S, cerevisiae bearing auxotrophic markers 
such as his3, leu2 f lys 2, trpl and ura3 can be easily 
5 transformed with the trehalose synthase genes by essentially 
the same methjods described for transformation of tssl 
disruptants with TSS1, Versions of the genes in which the 
natural promoters and terminators are intact or have been_ 
replaced by (stronger and regulatable) promoters and 
10 terminators from other yeast genes can be used. For example, 
PGKI [pMA91; Mellor et al (1983) Gene 24, 1-14], ADC1 [pAAH5; 
Ammerer (1983) Methods in Enzymology 101, 192-201] and MEL1 
[pALK3537, pALK41, etc., Suominen, P.L. (1988) Doctoral 
dissertation. University of Helsinki] systems have been used to 
15 increase the expression levels of genes in S. cerevisiae and 
other yeast. The MEL1 system has the advantage that the 
expression can be regulated, being repressed by glucose and" 
induced by galactose. Standard vectors are available [episomal 
and integrating and centromere yeast plasmids are reviewed by 
20 Rose & Broach (1990) Methods in Enzymology 185, 234-279 and 

Stearns, T. , Ma, H & Botstein, D. (1990) Methods in Enzymology 
185, 280-291] that incorporate auxotrophic markers such as 
HIS3, LEU2, TRP1 and URA3, which can be used to select the 
transf ormants . Vectors based on these principles, but suited 
25 to a particular task can be constructed by a person familiar 
with the art. 

The basic strategy is to leave the yeast with an intact version 
of its natural genes for trehalose synthase and introduce, 

30 either on episomes or integrated into a yeast chromosome, extra 
copies of the genes. These may be under control of their own 
promoters, or of stronger promoters and promoters that can be 
regulated, for example by adding substances such as galactose 
to the growth medium or by changing the temperature. The use of 

35 such promoters has been described [see, e.g., Mylin et al. 

(1990) Methods in Enzymology 185, 297-3 08; Sledziewski et al. 
(1990) Methods in Enzymology 185, 351-366]. This strategy 
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avoids problems that can be foreseen if all copies of the genes 
genes are put under tight control (such as the defects in sugar 

i 

catabolism expected if TSS1 is not properly expressed; see 
Example 7.) Transformed yeast bearing additional copies of the 
5 genes with their natural promoters may accumulate enough 

trehalose to exhibit the desired improvement in stability. They 
may also cycle enough glucose units through trehalose during 
fermentative conditions to generate an ATPase that accelerates 
fermentation and increases the yield of ethanol on glucose. 

10 Alternatively , the promoters of one or more genes can be 

changed to promoters that are more active under fermentative 
conditions. In another aspect of the invention, copies of the 
ORFs of the genes can be inserted into expression vectors 
equipped with powerful promoters (that may be regulatable) to 

15 cause still larger increases in trehalose. This can be 
particularly useful for the production of trehalose. 

Transforming yeast with two or all three genes can be achieved 
in several ways. The most obvious procedure is to use different 
20 auxotrophic markers and introduce the genes sequentially. 

Another method is to construct a Yip containing URA3 and a 
modified version of, say, TSLl with a stronger promoter but 
still containing a region of homology upstream of this 
promoter. After directed integration of this plasmid to the 

15 chromosomal ura3 site and selection of URA+ transf ormants , 
mutants in which the URA3 has again been excised (with a 
frequency of about 1 x 10" 4 ) can be selected by growth on media 
containing 5-f luoroorotic acid [see Stearns et al. (1990) loc. 
cit.]. Some of the selected cells would contain a new version 

0 of the gene, with the stronger promoter and can again be 

transformed, this time with, say, a modified TSS1 gene. The 
resultant transformants will contain one copy of TSLl driven by 
the new promoter, and two copies of TSS1, one of which is still 
under the control of its natural prompter. Thirdly, a Yip 

5 containing two or all three genes can be used to introduce the 
genes in a single step. 



WO 93/17093 



PCT/FI93/00049 



79 

Various methods to transform industrial, polyploid yeast, which 
lack auxotrophic markers have been described in the literature. 

i 

Earlier methods have been reviewed by Knowles, J.K.C. & Tubb, 
R.s. [(1987) E.B.C. symposium on brewer's yeast, Helsinki, 
5 1986. Monograph XII 169-185] and include the use of marker 
genes that confer resistance to antibibtics, methylglyoxal , 
copper, cinnamic acid and other compounds. These markers 
facilitate selection of transformants. Some_ of_ the marker _genes_ 
are themselves of yeast origin, and so are preferred for 

10 acceptability reasons. When suitable modifications and 
combinations of the genes have been identified by using 
laboratory yeast, they may be transferred to industrial yeast 
using these procedures or others described in the literature, 
such as co-transformation with pALK2 and pALK7 [Suominen, P. I. 

15 (1988) loc. cit.].. These plasmids contain a readily selectable 
MEL1 marker gene on a 2 /i-based plasmid that can readily be 
cured, thus facilitating sequential transformation with more 
than one gene if it is not practicable to introduce the 
modified genes in one step using this co-transformation 

20 procedure. 

Example 15. Transformation of crop plants . 

Methods for the transformation of higher plants, including crop 
25 plants of economic importance , have been described [Goodman et 
al (1987) Science 236, 48-54; Weising et al, (1988) Annual 
Review of Genetics 22 , 421-477; Glasser & Fraley (1989) Science 
244, 1293-1299; Lindsey, K. (1992) Journal of Biotechnology 26, 
1-28] and laboratory manuals setting, out standard procedures 
30 are available such as the Plant Molecular Biology Manual [ed. 
Gelvin & Schnilperoort (1988) Kluwer Academic Press]* Of 
particular utility is the use of tissue specific promoters from 
the genes of proteins that are expressed in a highly 
tissue-specific manner [see, e.g., Higgins (1984) Annual Beview 
35 of Plant Physiology 35, 191 et sea . ; Shotwell and Larkins 

(1989) in The Biochemistry of Plants 15, 297 et seq . 1 . The use 
of such promoters will allow the expression of trehalose 
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synthase in (a) specifically the frost- and drought-sensitive 
tissues of plants so that they may be protected from these and 
equivalent stresses without diverting carbohydrate metabolism 
in the major storage tissues, or alternatively (b) precisely in 
5 the edible tissues. The purpose of this second alternative is 
to cause the accumulation of trehalose in products such as the 
fruit of tomatoes, in order to increase the shelf- life of these 
products. The expression of non-plant genes, with higher A+T 
contents than are commonly found in plant genes can generally 

10 be improved by changing the codons to increase the G+C content, 
and in particular to avoid regions of overall high A+T content 
Perlak et al. (1991) loc. cit. ] . It is foreseen that such 
modifications will be beneficial in the case of the genes TSS1 
and TSL1, which have A+T-rich regions. Selection systems are 

15 available for use in the transformation of higher plants, 

including plasmids comprising the gene (hpt) for hygromycin 
phosphotransferase [Dale & Ow (1991) Proceedings of the 
National Academy of Sciences, USA 88, 1055810562] . These and 
similar methods familiar to persons skilled in the art can be 

!0 ■ used, first to introduce various modifications of the yeast 

trehalose synthase genes into Arabidopsis thaliana , and then to 
transfer the most successful modifications to plants of 
economic importance, 

5 One example of how one- would transform a crop plant (dicots and 
some monocots) is via a Ti plasmid. A large fragment of the Ti 
plasmid encompassing both the T-DNA and vir regions is first 
cloned into the common bacterial plasmid pBR322. One or more of 
the trehalose synthase genes are then cloned into a non- 

0 essential region of the T-DNA and introduced into Aorobacterium 
tumefaciens carrying an intact Ti plasmid. The plants are then 
infected with these bacteria and the gene products of the vir 
region on the intact Ti plasmid mobilize the recombinant T-DNA, 
and the recombinant T-DNA integrates into the plant geome. One 

5 or more of the trehalose synthase genes can be introduced into 
the plant in this manner, by inserting the genes into either 
the same plasmid or separate plasmids. 



WO 93/1 7093 



PCT/FI93/0Ofc;9 



81 

SEQUENCE LISTING 
(1) GENERAL INFORMATION 

5 

(i) APPLICANT: 

(A) NAME: ALKO LTD. (et al.) 

(B) STREET: Salmisaarenranta 7 

(C) CITY: Helsinki 
10 (D) COUNTRY: FINLAND 

(E) POSTAL CODE: SF-00100 

(ii) TITLE OF INVENTION: Increasing the trehalose content of 
15 organisms by transforming them with 

combinations of the structural genes 
for trehalose synthase. 



20 (iii) NUMBER OF SEQUENCES: 85 

(iv) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US / 07/836,021 

(B) FILING DATE: 14 February 1992 

(A) APPLICATION NUMBER: US / 07/841,997 

(B) FILING DATE: 28 February 1992 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2481 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Doublestranded 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI -SENSE: no 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: S288C 

(E) HAPLOTYPE: Haploid 

50 

(vii) IMMEDIATE SOURCE 

(A) LIBRARY: Genomic 

(B) CLONE: 20 

55 (viii) POSITION IN GENOME: 
(A) CHROMOSOME: 2R 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



i 

TTTTTAAACG TATATAGATG TCTACATGTG TGTTTTTGTT TTTTTACGTA 50 
5 CGTATACCCA CCTATATATG CATAATCCGT AATTGAAAAA AAAAAAAGTA 100 
AAGATCAAGG AACACATCAC CCTGGGCACA TCAAGCGTGA GGAATGCCGT 150 
CCAACTGGTG GAGACGCTTG ATTTGCTCTT TTTGTTCCTG GGTCCAACCC 200 
GGTCTCGAAG AACATCAGCA CCACGCCCGC AACGACAAAG AACATTGCAA 250 
TACACTTGCA TATGTGAGCA TAGTCGAGCG GTCCGTTCTG TGGTTGATGC 300 
10 TGTTGTTCTT TCTTCTGTTT GTCAGGGGTG ATAGCCATAT CTTCGTGCTC 350 
TTGTTGCGAT TGTTCTGTTC CATCTGCACC AGAACAAAGA ACAAAAGAAC 4O0 
AAGGAACAAA GTCCAAGCAC GTCAGCGCTG TTTATAAGGG GATTGACGAG 450 
GGATCGGGCC TAGAGTGCCA GCGCGCCAGG GAGAGGGAGC CCCCTGGGCC 500 
CTCATCCGCA GGCTGATAGG GGTCACCCCG CTGGGCAGGT CAGGGCAGGG 550 
15 GCTCTCAGGG GGGCGCCATG GACAAACTGC ACTGAGGTTC TAAGACACAT 6O0 
GTATTATTGT GAGTATGTAT ATATAGAGAG AGATTAAGGC GTACAGCCGG 650 
GTGGTAGAGA TTGATTAACT TGGTAGTCTT ATCTTGTCAA TTGAGTTTCT 700 
GTCAGTTTCT TCTTGAACAA GCACGCAGCT AAGTAAGCAA CAAAGCAGGC 750 
TAACAAACTA GGTACTCACA TACAGACTTA TTAAGACATA GAACTATGAC 800 
20 TACGGATAAC GCTAAGGCGC AACTGACCTC GTCTTCAGGG GGTAACATTA 850 
TTGTGGTGTC CAACAGGCTT CCCGTGACAA TCACTAAAAA CAGCAGTACG 900 
GGACAGTACG AGTACGCAAT GTCGTCCGGA GGGCTGGTCA CGGCGTTGGA 950 
AGGGTTGAAG AAGACGTACA CTTTCAAGTG GTTCGGATGG CCTGGGCTAG 100 0 
AGATTCCTGA CGATGAGAAG GATCAGGTGA GGAAGGACTT GCTGGAAAAG 1050 
25. TTTAATGCCG TACCCATCTT CCTGAGCGAT GAAATCGCAG ACTTACACTA 1100 * 
CAACGGGTTC AGTAATTCTA TTCTATGGCC GTTATTCCAT TACCATCCTG 1150 
GTGAGATCAA TTTCGACGAG AATGCGTGGT TCGGATACAA CGAGGCAAAC 1200 
CAGACGTTCA CCAACGAGAT TGCTAAGACT ATGAACCATA ACGATTTAAT 1250 
CTGGGTGCAT GATTACCATT TGATGTTGGT TCCGGAAATG TTGAGAGTCA 1300 
30 AGATTCACGA GAAGCAACTG CAAAACGTTA AGGTCGGGTG GTTCCTGCAC 1350 
ACACCATTCC CTTCGAGTGA AATTTACAGA ATCTTACCTG TCAGACAAGA 140 0 
GATTTTGAAG GGTGTTTTGA GTTGTGATTT AGTCGGGTTC CACACATACG 1450 
ATTATGGAAG ACATTTCTTG TCTTCCGTGC AAAGAGTGCT TAACGTGAAC 1500 
ACATTGCCTA ATGGGGTGGA ATACCAGGGC AGATTCGTTA ACGTAGGGGC 1550 
35 CTTCCCTATC GGTATCGACG TGGACAAGTT CACCGATGGG TTGAAAAAGG 1600 
AATCCGTACA AAAGAGAATC CAACAATTGA AGGAAACTTT CAAGGGCTGC 1650 
AAGATCATAG TTGGTGTCGA CAGGCTGGAT TACATCAAAG GTGTGCCTCA 1700 
GAAGTTGCAC GCCATGGAAG TGTTTCTGAA CGAGCATCCA GAATGGAGGG 1750 
GCAAGGTTGT TCTGGTACAG GTTGCAGTGC CAAGTCGTGG AGATGTGGAA 1800 
40 GAGTACCAAT ATTTAAGATC TGTGGTCAAT GAGTTGGTCG GTAGAATCAA 1850 
CGGTCAGTTC GGTACTGTGG AATTCGTCCC CATCCATTTC ATGCACAAGT 1900 
. CTATACCATT TGAAGAGCTG ATTTCGTTAT ATGCTGTGAG CGATGTTTGT 1950 
TTGGTCTCGT CCACCCGTGA TGGTATGAAC TTGGTTTCCT ACGAATATAT 2000 
TGCTTGCCAA GAAGAAAAGA AAGGTTCCTT AATCCTGAGT GAGTTCACAG 2050 
45 GTGCCGCACA ATCCTTGAAT GGTGCTATTA TTGTAAATCC TTGGAACACC 2100 
GATGATCTTT CTGATGCCAT CAACGAGGCC TTGACTTTGC CCGATGTAAA 2150 
GAAAGAAGTT AACTGGGAAA AACTTTACAA ATACATCTCT AAATACACTT 2200 
CTGCCTTCTG GGGTGAAAAT TTCGTCCATG AATTATACAG TACATCATCA 2250 
AGCTCAACAA GCTCCTCTGC CACCAAAAAC TGATGAACCC GATGCAAATG 2 300 
50 AGACGATCGT CTATTCCTGG TCCGGTTTTC TCTGCCCTCT CTTCTATTCA 2 350 
CTTTTTTTAT ACTTTATATA AAATTATATA AATGACATAA CTGAAACGCC 2400 
ACACGTCCTC TCCTATTCGT TAACGCCTGT CTGTAGCGCT GTTACTGAAG 2450 
CTGCGCAAGT AGTTTTTTCA CCGTATAGGC C 2481 
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(3) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 495 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: Polypeptide 
10 (iii) HYPOTHETICAL: Yes 

— (xi) -SEQUENCE DESCRIPTION: SEQ ID~ NO: 2 



15 Met Thr Thr Asp Asn Ala Lys Ala Gin Leu Thr Ser Ser Ser Gly 

5 10 15 

Gly Asn lie He Val Val Ser Asn Arg Leu Pro Val Thr He Thr 

20 25 30 

Lys Asn Ser Ser Thr Gly Gin Tyr Glu Tyr Ala Met Ser Ser Gly 
20 35 40 45 

Gly Leu Val Thr Ala Leu Glu Gly Leu Lys Lys Thr Tyr Thr Phe 

50 55 60 

Lys Trp Phe Gly Trp Pro Gly Leu Glu He Pro Asp Asp Glu Lys 

65 70 75 

25 Asp Gin Val Arg Lys Asp Leu Leu Glu Lys Phe Asn Ala Val Pro 

80 85 90 

lie Phe Leu Ser Asp Glu He Ala Asp Leu His Tyr Asn Gly Phe 

95 100 105 

Ser Asn Ser He Leu Trp Pro Leu Phe His Tyr His Pro Gly Glu 
30 110 115 120 

He Asn Phe Asp Glu Asn Ala Trp Phe Gly Tyr Asn Glu Ala Asn 

125 130 135 

Gin Thr Phe Thr Asn Glu He Ala Lys Thr Met Asn His Asn Asp 

140 145 150 

5 Leu He Trp Val His Asp Tyr His Leu Met Leu Val Pro Glu Met 

155 160 165 

Leu Arg Val Lys He His Glu Lys Gin Leu Gin Asn Val Lys Val 

170 175 180 

Gly Trp Phe Leu His Thr Pro Phe Pro Ser Ser Glu He Tyr Arg 
0 185 190 195 

lie Leu Pro Val Arg Gin Glu He Leu Lys Gly Val Leu Ser Cys 

200 205 210 

Asp Leu Val Gly Phe His Thr Tyr Asp Tyr Ala Arg His Phe Leu 

215 220 225 

5 Ser Ser Val Gin Arg Val Leu Asn Val Asn Thr Leu Pro Asn Gly 

230 235 240 

Val Glu Tyr Gin Gly Arg Phe Val Asn Val Gly Ala Phe Pro He 

245 250 255 

Gly He Asp Val Asp Lys Phe Thr Asp Gly Leu Lys Lys Glu Ser 
0 260 265 270 

Val Gin Lys Arg He Gin Gin Leu Lys Glu Thr Phe Lys Gly Cys 

275 280 285 

Lys He He Val Gly Val Asp Arg Leu Asp Tyr He Lys Gly Val 

290 295 300 

5 Pro Gin Lys Leu His Ala Met Glu Val Phe Leu Asn Glu His Pro 

305 310 315 
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10 



15 



20 



25 



Glu Trp 
Arg Gly 
Glu Leu 
Val Pro 
lie Ser 
Arg Asp 
Glu Glu 
Ala Gin 
Asp Asp 
Val Lys 
Lys Tyr 
Tyr Ser 



Arg Gly 
Asp Val 
Val Gly 
He His 
Leu Tyr 
Gly Met 
Lys Lys 
Ser Leu 
Leu Ser 
Lys Glu 
Thr Ser 
Thr Ser 



Lys 

320 

Glu 

335 

Arg 

350 

Phe 

365 

Ala 

380 

Asn 

395 

Gly 

410 

Asn 

425 

Asp 

440 

Val 

455 

Ala 

470 

Ser 

485 



Val 
Glu 
He 
Met 
Val 
Leu 
Ser 
Gly 
Ala 
Asn 
Phe 
Ser 



Val Leu 
Tyr Gin 
Asn Gly 
His Lys 
Ser Asp 
Val Ser 
Leu He 
Ala He 
He Asn 
Trp Glu 
Trp Gly 
Ser Thr 



Val 

Tyr 

Gin 

Ser 

Val 

Tyr 

Leu 

He 

Glu 

Lys 

Glu 

Ser 



Gin 

325 

Leu 

340 

Phe 

355 

He 

370 

Cys 

385 

Glu 

400 

Ser 

415 

Val 

430 

Ala 

445 

Leu 

460 

Asn 

475 

Ser 

490 



Val Ala 
Arg Ser 
Gly Thr 
Pro Phe 
Leu Val 
Tyr He 
Glu Phe 
Asn Pro 
Leu Thr 
Tyr Lys 
Phe Val 
Ser Ala 



Val 

Val 

Val 

Glu 

Ser 

Ala 

Thr 

Trp 

Leu 

Tyr 

His 

Thr 



Pro 
Val 
Glu 
Glu 
Ser 
Cys 
Gly 
Asn 
Pro 
He 
Glu 
Lys 



Ser 

330 

Asn 

345 

Phe 

360 

Leu 

375 

Thr 

390 

Gin 

405 

Ala 

420 

Thr 

435 

Asp 

450 

Ser 

465 

Leu 

480 

Asn 

495 



(4) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS 
30 (A) LENGTH: 3 000 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: Doublestranded 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cere vis iae 

(B) STRAIN: S288C 

(E) HAPL0TYPE: Haploid 

45 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Genomic 

(B) CLONE: 6 

50 (vii) POSITION IN GENOME 
(A) CHROMOSOME: 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



55 
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CCTCCTCTGG 
ACGCCCCACT 
ACAGCCTTCT 
AGACGTCTTC 
5 CTGACGAGGA 
AGGTTCAAAG 
TAAGGTCATC 
TCTATCAAAG 
GGAGAATATC 
10 TCCCAACGGA 
TTAAAAGACA 
CAAAGCCGCA 
ATTACCAGAT 
TGGAAGTTCT 
15 AATCTATAAG 
TGGTTCCGCA 
ACCTTACATG 
GCGTGAGAAG 
AGACGAGGGA 
20 ATGGCGGACG 
TTCTGTGAGG 
AATTGAAGGA 
TGGCAAGGGA 
AGGTATTCAC 
25 CGGAATACGT 
AAGGATGTAG 
CTCGCTATCC 
AAGATCTAGA 
TTCGTAGTCA 
30 TATCGTTTGT 
CTGGTAGTGC 
GATACCAAGA 
CGATAAGAGA 
ACGACTCTAC 
35 TGGCAATTCA 
ACTGATGGAA 
TTGCTGAACC 
TCTAAGGGCA 
GGAAAATCTT 
40 GTGCATACGT 
GATTGGCGTA 
ACCTGGCTCG 
AAAATGCGGA 
ACACATATCA 
*5 CAAAAACGTT 
AATTTCTTTT 
TCCGGCCAAA 
TCAAGAACAA 
ATTTCGCATG 
AAATTGGTCA 
CATTGTTTAT 
GGTTAAACGA 
TTACCATTTT 
TGCTTATTAT 
ATTATAAAAT 
ATGGGTAGCT 



ATCTTCTGGG 
TGACTGCGTC 
AA^TCTGAAAT 
GCAGCATAAT 
TATGTTTCTG 
TTCGGCGCTA 
TCAGGAGCTG 
GTAATGGCGC 
ATTCCGCACC 
TGAGATTCCG 
AGTACGACTC 
TACAAAAACT 
TCCAGACAAT 
ATAGAAACTT 
AAAGGTGACA 
GATGGTGAGA 
TCTCGTTCCC 
ATCTTAGAAG 
GTATGCAAGA 
TGGTACATGA 
TTCACCCCAG 
TGGAAGTGTC 
AAAAACTAAT 
AAGAAATTGT 
GGAAAAATCG 
AACTGGAGCG 
ACCAATATTA 
TTTTTCTCAG 
GCTCTCTAAG 
TCTGAGGACA 
ATCTTTATTG 
ACTTCTCACA 
AGGCCACAGT 
AAACTGGATC 
ATCAAGAAGG 
GATTACCAGT 
ACCTTCATCG 
ATATCGTTTA 
TACAGTCGTG 
TAGTCTGAAC 
ACGATGTAGC 
TACTACAAGA 
AGATCAAGAT 
ATACTGTTTT 
GTTTCCGTAC 
CAGATTCTAT 
TCACAAATAT 
CAACCTCCAG 
TGTCTCTGGT 
ATGATGAAGC 
GGTGATGCTA 
ACTTTTCACG 
AAAATTTTAA 
ATATCAATTC 
TCATCCTCTT 
ATTATTCATT 



TCTTCTGCGC 
TGCTGCAAAA 
ATTCGGAGTT 
GAGTCGGACC 
ATTAGGAATT 
TTCATAAATC 
TTTAGCCGTC 
CATGAAGAAC 
GTCATGTTAA 
GAAAATATCC 
CTATCCTGTC 
AGTGTAAACA 
CCGAACTCGA 
AAACCAAAGG 
CCATCTGGAT 
GACGTCTTGC 
CAGTAGTGAA 
GCTTGACCGG 
CATTTCTTAC 
TGAAGAGCTA 
TTGGTATCGA 
ATGCAATGGC 
TGTGTGTCGT 
TGGCTTATGA 
ACTTTAATTC 
CCAGATCATG 
GTATTTCTCA 
TATTTAGCTT 
GGAAGGTATG 
AAAATGCTCC 
AATGATGGCG 
AGCCATTCTC 
GGAAGAAATT 
AAGACTTCTT 
TTCCAAGATC 
CATCTAAAAA 
AGAATGATTT 
CATCATGAAC 
TGCAAAACAT 
GGTGTATGGT 
CAAAATTCTC 
TAAATGAGTC 
CGTGTAGCTA 
TGACCACAGA 
AACAAGTGGG 
AATTCTGCTT 
TCAGACACCA 
CCTCTCCCAC 
TCATCGTCTC 
AAGTGAAGGG 
CTTCTACTTA 
ATCATTTCAA 
TGTTCTTGGG 
TATAAATTTT 
ATTTATTACA 
TTTGCTTCGT 



CACCTTCCAT 
CAGCGTCCCT 
AGCAGATATT 
CGGATGATCT 
GATGACGCGA 
AACTAAGAAA 
TTCCATGGTC 
GCCATAAACA 
GTGGGTCGGT 
TTGCGAACAT 
CTTACGGACG 
AATCTTGTGG 
AGGCTTTTGA 
TTTGCGGACG 
TCATGATTAC 
CTTTTGCCAA 
GTGTTTAGGT 
TGCAGACTTT 
AGACGTCTAA 
AAGTATAACG 
CGCCTTTGAT 
GTCAATTGAT 
GATCAATTCG 
AAAATTCTTG 
AAATCTGTAT 
ATTGTCGTGG 
ACCTGTGGTG 
TGAGTTCAGA 
AACTTGACAT 
CCTACTGTTG 
CTATAATAAT 
AAGGGGTTGG 
GATGAAAGAC 
TACAAGATAT 
TTCAAATTGA 
GCGTATGTTT 
CCATACTGAA 
TCATTTCCAA 
TGGGTTGATT 
ACAACATTGT 
GAGGACAAAG 
CATGATCAAG 
GTGTTATCGG 
GGTATTCATG 
ACTTTCCTTA 
CGGATCCACT 
TCTCAACAAA 
TGTGTCGATG 
CTGTGCTTGA 
CAAGTAAAAG 
TGCCAAAGAA 
GAATCATTGA 
TATGAACTTT 
TTTCTTCTCT 
GCATCTTATA 
AAGGACTTTT 



TAAAAGGATT 50 
TATTGGCTAA 1 00 
TCGTCGAGTG 150 
AACTACTGCC 2 00 
GAGGACTACA 250 
TATGCGCTGT 3 00 
GATCGTTCCC 350 
CTGCAGTCTT 400 
ACCGTCGGAA 450 
CTCTGACTCT 500 
ACGACACCTT 550 
CCTACGCTGC 6O0 
AGATCACTCT 650 
CGATCGTTAA 7O0 
CATTTAATGC 750 
AATAGGATTT 8O0 
GTCTGGCTCA 850 
GTCGGCTTCC 9O0 
CCGTCTGCTA 950 
GCAGAGTCGT 1 0 O 0 
TTGCAATCGC 1050 
TCGTGAAAGA 11O0 
ATAGAATTAG 1150 
GTCGAAAATC 12 OO 
TGGAAGCAGT 1250 
ATAGAATCAA 13 O0 
TTTTTGCATC 1350 
GGCAGATTTG 140 0 
GTCACGAATT 1450 
TCAGAATTTA 15O0 
TAACCCATGG 1550 
AGATGCCATT 16O0 
ATTATCAACA 1650 
TCATATTTCG 17 O0 
ATACAAAAAC 1750 
GTTTTCAACA 18O0 
TGACATGACT 1850 
AGCCCATTCT 1900 
GCCGAGAATG 1950 
TGATCAAGTC 2 0O0 
TGGAGAGATT 2 050 
TTCCACACTG 2100 
TGATGCCATC 2150 
CCTACGTTTA 2200 
TCGGCAGCTC 2250 
GGATACGAGT 2300 
ATCCTTCAGA 2350 
AACCATATTG 2 400 
ACCATTGTrC 2450 
CCGGACACGC 2500 
CATGTAAATG 2550 
AGATTAAATT 2 600 
TATTTTCAAC 2 650 
CTAACGACCA 2700 
CATTATGTAT 2750 
TTTGTCAACT 2 80 0 
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TTTTCATCCT AAGCGGCTAA AAGTGATTGG AGAGGAATGT CCAGGCGACC 2850 

AATGATAAAA ACGCTTTCTC TTGGAACAAG AAATAGGAGC AATTGACAGT 29O0 

TGTCGATGAA CAGCGAAAAT AGTAAGATAA CCTTCAAGCC CAATATTCTA 2950 

ATTAAAGGCG TTTATATATT TGTACTTTAT GGTATGTGCA TATGTATTGT 3 000 



(5) INFORMATION FOR SEQ ID NO: 4 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 amino acids 

. _ (B)_ TYPE: -Amino acid 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULAR TYPE: Polypeptide 

(iii) HYPOTHETICAL : Yes 

(v) FRAGMENT TYPE: C- terminal 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



25 



30 



35 



40 



45 



50 



55 



Arg Gly Leu Gin 
Lys Lys Tyr Ala 
Leu Pro Trp Ser 
Lys Asn Ala lie 
Arg His Val Lys 
lie Pro Glu Asn 
Lys Tyr Asp Ser 
Ala Ala Tyr Lys 
His Tyr Gin lie 
His Ser Trp Lys 
Ala lie Val Lys 
Asp Tyr His Leu 
Pro Phe Ala Lys 
Ser Glu Val Phe 
Gly Leu Thr Gly 
Ala Arg His Phe 
Val Val His Asp 



Gly 


Ser 


Lys Phe Gly Ala 


He His Lys Ser Thr 


5 




10 


15 


Leu 


Leu 


Arg Ser Ser Gin 


Glu Leu Phe Ser Arg 


20 




25 


30 


He 


Val 


Pro Ser He Lys 


Gly Asn Gly Ala Met 


35 




40 


45 


Asn 


Thr 


Ala Val Leu Glu 


Asn He He Pro His 


50 




55 


60 


Trp 


Val 


Gly Thr Val Gly 


He Pro Thr Asp Glu 


65 




70 


75 


He 


Leu 


Ala Asn lie Ser 


Asp Ser Leu Lys Asp 


80 




85 


90 


Tyr 


Pro 


Val Leu Thr Asp 


Asp Asp Thr Phe Lys 


95 




100 


105 


Asn 


Tyr 


Cys Lys Gin He 


Leu Trp Pro Thr Leu 


110 




115 


120 


Pro 


Asp 


Asn Pro Asn Ser 


Lys Ala Phe Glu Asp 


125 




130 


135 


Phe 


Tyr 


Arg Asn Leu Asn 


Gin Arg Phe Ala Asp 


140 




145 


150 


He 


Tyr 


Lys Lys Gly Asp 


Thr He Trp He His 


155 




160 


165 


Met 


Leu 


Val Pro Gin Met 


Val Arg Asp Val Leu 


170 




175 


180 


He 


Gly 


Phe Thr Leu His 


Val Ser Phe Pro Ser 


185 




190 


195 


Arg 


Cys 


Leu Ala Gin Arg 


Glu Lys He Leu Glu 


200 




205 


210 


Ala 


Asp 


Phe Val Gly Phe 


Gin Thr Arg Glu Tyr 


215 




220 


225 


Leu 


Gin 


Thr Ser Asn Arg 


Leu Leu Met Ala Asp 


230 




235 


240 


GlU 


Glu 


Leu Lys Tyr Asn 


Gly Arg Val Val Ser 
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10 



15 



20 



25. 



30 



35 



40 



45 



50 



55 



245 250 255 

Val Arg Phe Thr Pro Val Gly lie Asp Ala Phe Asp Leu Gin Ser 

260 265 , 270 

Gin Leu Lys Asp Gly Ser Val Met Gin Trp Arg Gin Leu He Arg 

275 280 285 

Glu Arg Trp Gin Gly Lys Lys Leu He Val Cys Arg Asp Gin Phe 

290 295 300 

Asp Arg He Arg Gly He His Lys Lys Leu Leu Ala Tyr Glu Lys 

305 310 315 

Phe Leu Val Glu Asn Pro Glu Tyr Val Glu Lys Ser Thr Leu He 

320 325 330 

Gin He Cys He _G.ly_Ser- Ser. Lys Asp Val Glu Leu Glu Arg Gin 

335 340 345 

He Met He Val Val Asp Arg He Asn Ser Leu Ser Thr Asn He 
350 355 360 

Ser He Ser Gin Pro Val Val Phe Leu His Gin Asp Leu Asp Phe 
365 370 375 

Ser Gin Tyr Leu Ala Leu Ser Ser Glu Ala Asp Leu Phe Val Val 
380 385 390 

Ser Ser Leu Arg Glu Gly Met Asn Leu Thr Cys His Glu Phe He 
395 400 405 

Val Cys Ser Glu Asp Lys Asn Ala Pro Leu Leu Leu Ser Glu Phe 

410 415 420 

Thr Gly Ser Ala Ser Leu Leu Asn Asp Gly Ala He He He Asn 

425 4.30 435 ' 

Pro Trp Asp Thr Lys Asn Phe Ser Gin Ala He Leu Lys Gly Leu 
440 445 450 

Glu Met Pro Phe Asp Lys Arg Arg Pro Gin Trp Lys Lys Leu Met 
455 460 465 

Lys Asp He He Asn Asn Asp Ser Thr Asn Trp He Lys Thr Ser 
470 475 480 

Leu Gin Asp He His He Ser Trp Gin Phe Asn Gin Glu Gly Ser 
485 490 . 495 

Lys He Phe Lys Leu Asn Thr Lys Thr Leu Met Glu Asp Tyr Gin 
500 505 510 

Ser Ser Lys Lys Arg Met Phe Val Phe Asn He Ala Glu Pro Pro 
515 520 525 

Ser Ser Arg Met lie Ser He Leu Asn Asp Met Thr Ser Lys Gly 
530 535 540 

Asn He Val Tyr He Met Asn Ser Phe Pro Lys Pro lie Leu Glu 
545 550 555 

Asn Leu Tyr Ser Arg Val Gin Asn He Gly Leu He Ala Glu Asn 
560 565 570 

Gly Ala Tyr Val Ser Leu Asn Gly Val Trp Tyr Asn He Val Asp 
575 580 585 

Gin Val Asp Trp Arg Asn Asp Val Ala Lys He Leu Glu Asp Lys 
590 595 600 

Val Glu Arg Leu Pro Gly Ser Tyr Tyr Lys He Asn Glu Ser Met 

605 610 615 

lie Lys Phe His Thr Glu Asn Ala Glu Asp Gin Asp Arg Val Ala 
620 625 €30 

Ser Val He Gly Asp Ala He Thr His He Asn Thr Val Phe Asp 
635 640 645 

His Arg Gly He His Ala Tyr Val Tyr Lys Asn Val Val Ser Val 

650 655 660 

Gin Gin Val Gly Leu Ser Leu Ser Ala Ala Gin Phe Leu Phe Arg 
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665 670 675 

Phe Tyr Asn Ser Ala Ser Asp Pro Leu Asp Thr Ser Ser Gly Gin 

680 685 690 

lie Thr Asn lie Gin Thr Pro Ser Gin Gin Asn Pro Ser Asp Gin 
5 695 700 705 

" Glu Gin Gin Pro Pro Ala Ser Pro Thr Val Ser Met Asn His lie 

710 715 720 

Asp Phe Ala Cys Val Ser Gly Ser Ser Ser Pro Val Leu Glu Pro 

725 730 735 

10 Leu Phe Lys Leu Val Asn Asp Glu Ala Ser Glu Gly Gin Val Lys 

740 745 750 

Ala Gly-His Ala-Ile- Val Tyr Gly Asp-Ala Thr-Ser Thr Tyr- Ala - 

755 760 765 

Lys Glu His Val Asn Gly Leu Asn Glu Leu Phe Thr lie lie Ser 
15 770 775 780 

Arg lie lie Glu Asp 

785 



20 (6) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS :* 

(A) LENGTH: 4 aiaino acids 

(B) TYPE: Amino acid 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
35 Tyr lie Ser Lys 



(7) INFORMATION FOR SEQ ID NO: 6 

40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 (ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-tenainal 

50 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Asp Val Glu Glu Tyr Gin Tyr Leu Arg 

5 
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( 8 ) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 
5 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(i-v) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

15 

His Phe Leu Ser Ser Val Gin Arg 

5- 



20 (9) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino acid 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
(V). SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Val Leu Asn Val Asn Thr Leu Pro Asn Gly Val Glu Tyr Gin 

5 10 



(10) INFORMATION FOR SEQ ID NO: 9 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Ser Val Val Asn Glu Leu Val Gly Arg 

5 
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(11) INFORMATION FOR SEQ ID NO: 10 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 
5 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

10 (iii) HYPOTHETICAL: No 

f -iv-)- FRAGMENT -TYPE :- N— terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Glu Thr Phe Lys 



(12) INFORMATION FOR SEQ ID NO: 11 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D)- TOPOLOGY: Linear 

25 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Leu Asp Tyr lie Lys 
35 5 



(13) INFORMATION FOR SEQ ID NO: 12 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 (ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

50 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

lie Leu Pro Val Arg 

5 

55 
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(14) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

( iy) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Glu Val Asn Xaa Glu Lys 

5 



(15) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Phe Tyr Asp Xaa Xaa 

5 



(16) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Leu Xaa Ala Met Glu Val Phe Leu Asn Glu Xaa Pro Glu 

5 10 
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(17) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
5 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) - FRAGMENT TYPE: N-terminal •- - 

(V) SEQUENCE DESCRIPTION: SEQ. ID NO: 16 

15 

Tyr Thr Ser Ala Phe Trp Gly Glu Asn Phe Val Xaa Glu Leu 

5 10 



20 (18) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

35 Phe Gly Xaa Pro Gly Leu Glu lie Pro 

5 



(19) INFORMATION FOR SEQ ID NO: 18 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Xaa Gly Ser Val Met Gin 
55 5 
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(20) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

■-- (iv-)- FRAGMENT TYPE:- N- terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Leu Pro Gly Ser Tyr Tyr Lys 

5 



(21) INFORMATION FOR SEQ ID NO: 2 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal . 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Asp Ala He Val Val Asn Pro Met Asp Ser Val Ala 

5 io 



(22) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Met He Ser He Leu 

5 
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(23) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 
5 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE : N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

15 

Arg Arg Pro Gin Trp Lys 

5 



20 (24) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 5 amino acids 

(B) TYPE: Amino acid 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

35 Ser Xaa Pro Gin Lys 

5 



(25) INFORMATION FOR SEQ ID NO: 24 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Phe Tyr Arg Asn Leu Asn Gin Arg Phe Ala Asp Ala lie Val Lys 
55 5 10 15 
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(26) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Asp Gly Ser Val Met Gin Xaa Xaa Gin Leu Xaa 

5 10 



(27) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 18 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Asn Ala lie Asn Thr Ala Val Leu Glu Asn lie lie Pro Xaa Xaa' 

5 10 15 

Xaa Val Lys 



(28) INFORMATION FOR SEQ ID NO:27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

Leu Val Asn Asp Glu Ala Ser Glu Gly Gin Val Lys 

5. 10 
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(29) INFORMATION FOR SEQ ID NO: 28 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

10 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

15 (v) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Xaa Gin Asp lie Leu Leu Asn Asn Thr Phe Xaa 

5 10 

20 

(30) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 
25 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
30 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

35 

Asp Thr Thr Gin Thr Ala Pro Val Xaa Asn Asn Val Xaa Pro 

5 10 



40 (31) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

55 Asn Gin Leu Asp Ala Xaa Asn Tyr Ala Glu Val 

5 10 
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(32) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: Yes 



(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

Asn Leu Ser Arg Trp Arg Asn Tyr Ala Glu 

5 io 



(33) INFORMATION FOR SEQ ID NO: 32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 4 amino- acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: Yes 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
Trp Gin Gly Lys 



(34) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(D) TOPOLOGY: Linear v 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

lie Gin Leu Gly Glu Ser Asn Asp Asp Xaa Xaa 

5 10 
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(35) INFORMATION FOR SEQ ID NO: 34 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 
5 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-tenainal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

15 

Gin Val Pro Thr lie Gin Asp Xaa Thr Asn Lys 

5 10 



20 (36) INFORMATION FOR SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N- terminal • 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

35 He Tyr Xaa Tyr Val Lys 

5 



(37) INFORMATION FOR SEQ ID NO: 36 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N- terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

Asn Gin Leu Gly Asn Tyr 
55 5 
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{38) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 4 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

<ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE :"N^ terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 37 
Val Ala Leu Thr 

(39) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 38 

Asp Ala lie Val Val Asn Pro Xaa Asp Ser Val Ala 

5 10 

(40) INFORMATION FOR SEQ ID NO: 39 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 39 

Thr Phe Thr Asn Tyr Asp Gly Ser Lys 

5 
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(41) INFORMATION FOR SEQ ID NO: 40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

_ "(iv)" FRAGMENT TYPE": "N- terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 40 

15 

Thr Gly Asn Asp Pro. Ser His lie Ala Lys 

5 10 



20 (42) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 41 

35 He Tyr Glu Ser Gin Gly Lys 

5 



(43) INFORMATION FOR SEQ ID NO: 42 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 42 

Ala Glu Gly Ala Thr Gly Gly Leu Val Pro His Lys 
55 5 10 
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(44) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(-i-v-)- FRAGMENT-TYPE :-N-termina-l - 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 43 

Leu Ala Thr Glu Leu Pro Ala Xaa Ser Lys 

5 10 

(45) INFORMATION FOR SEQ ID NO: 44 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 44 

Ser Leu Leu Asp Ala Gly Ala Lys 

5 

(46) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 45 

Glu Lys Pro Gin Asp Leu Asp Asp Asp Pro Leu Tyr Leu Thr 
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(47} INFORMATION FOR SEQ ID NO: 46 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL : No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 46 

15 

Xaa Gin Xaa His Gin Asp Xaa Xaa Asn Leu Thr 

5 10 



20 (48) INFORMATION FOR SEQ ID NO: 47 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: ■ Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL : No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 47 

35 Ptie Asn Asp Glu Ser lie lie lie Gly Tyr Phe Xaa Xaa Ala Pro 

5 10 15 



(49) INFORMATION FOR SEQ ID NO: 48 
40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 48 

Ser Arg Leu Phe Leu Phe Asp Tyr Asp Gly Thr Leu Thr Pro 
55 5 10 
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(50) INFORMATION FOR SEQ ID NO:49 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

( iv) -FRAGMENT- TYPE: - N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 49 

15 

Gin Leu Gly Asn Tyr Gly Phe Tyr Pro Val Tyr. 

5 10 



20 (51) INFORMATION FOR SEQ. ID NO: 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

0 

(iv) FRAGMENT TYPE: N-terminal . 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 50 

5 Phe Leu Val Glu Asn Pro Glu Tyr Val Glu Lys 

5 10 



(52) INFORMATION FOR SEQ ID NO: 51 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 51 

Xaa lie Thr Pro His Leu Thr Ala Xaa Ala Ala 

5 io 
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(53) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv)- FRAGMENT- -TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 52 

15 

Thr Leu Met Glu Asp Tyr Gin Ser Ser Lys 

5 ' 10 



20 (54) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 
30 

(iv) FRAGMENT TYPE: N- terminal 

(v) - SEQUENCE DESCRIPTION FOR SEQ ID NO: 53 

35 lie Leu Glu Gly Leu Thr Gly Ala Asp Phe Val Gly Phe Gin Thr 

5 10 15 



(55) INFORMATION FOR SEQ ID NO: 54 
40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N- terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 54 

Gin lie Leu Xaa Pro Thr Leu Xaa Tyr Gin lie Pro Asp Asn 
55 5 10 
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(56) INFORMATION FOR SEQ ID NO: 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

_ . ( iv) _ FRAGMENT TYPE : N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 55 

15 

Phe Gly Gly Tyr Set Asn Lys 

5 



20 (57) INFORMATION FOR SEQ ID NO: 56 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 23 amino acids 

(B) TYPE: Amino acid 
25. (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 56 

35 Phe Xaa Thr Glu Asn Ala Glu Asp Gin Asp Xaa Val Ala Xaa Val 

5 10 15 

lie Gly Xaa Ala lie Xaa Xaa lie 

20 

40 

(58) INFORMATION FOR SEQ ID NO: 57 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 18 amino acids 
45 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
50 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
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(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 57 

Xaa Val Gly Thr Val Gly lie Pro Thr Asp Glu lie Pro Glu Asn 

5 10 15 

He Leu Ala 



(59) INFORMATION FOR SEQ ID NO: 58 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A)-LENGTH-l 9_ amino- -acids - - - - _ 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

15 

(ii) MOLECULAR TYPE: Peptides 

(iii) HYPOTHETICAL: No 

20 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 58 

Leu Leu Val His Ser Leu Leu Asn Asn Thr Ser Gin Thr' Ser Leu 
25 5 10 15 

Glu Gly Pro Asn 



30 (60) INFORMATION FOR SEQ ID NO: 59 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 20 amino acids 

(B) TYPE: Amino acid 
35 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

40 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 59 

45 Ser Ser Thr Thr Asn Thr Ala Thr Leu Xaa Xaa Leu Val Ser Ser 

5 10 15 

Xaa He Phe Met Glu 

20 

50 



) 
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(61) INFORMATION FOR SEQ ID NO: 60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 
5 (B) TYPE: Amino acid 

(C) topology : Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT. TYPE: _N-terminal . . . __ 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 60 

15 

Ala Xaa Asn Arg Pro Thr Ser Ala Ala Thr Ser Leu Val Asn Arg 

5 io 15 



20 (62) INFORMATION FOR SEQ ID NO: 61 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

0 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 61 

5 Xaa Phe Thr lie lie Xaa 

5 



(63) INFORMATION FOR SEQ ID NO: 62 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 62 

Asn Leu Thr Ala Asn Ala Thr Thr Ser His Thr Pro Thr Ser Lys 

5 10 15 
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(64) INFORMATION FOR SEQ ID NO: 63 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

... ( iv) . FRAGMENT. TYPE : N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 63 

15 

Phe Xaa Xaa Tyr Ser Asn Lys 

5 



20 (65) INFORMATION FOR SEQ ID NO: 64 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 64 

35 Xaa Pro Xaa Ala Phe Asn Xaa 

5 



(66) INFORMATION FOR SEQ ID NO: 65 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 65 

lie Ala Ser Pro lie Gin Xaa Glu 
55 5 
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(67) INFORMATION FOR SEQ ID NO: 66 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT JTYPE : N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 66 

15 

Gin Arg Pro Leu Leu Ala Lys 

5 



20 (68) INFORMATION FOR SEQ ID NO: 67 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
• (iii) HYPOTHETICAL: No 

0 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 67 

5 Phe Phe Ser Pro Ser Ser Asn lie Pro Thr Asp Arg 

5 10 



(69) INFORMATION FOR SEQ ID NO: 68 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 68 

Ala Leu Ser Asn Asn lie Ser Gin Glu 

5 
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(70) INFORMATION FOR SEQ ID NO: 69 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) .FRAGMENT ..TYPE.:... N-terminal . _ . . 
(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 69 



15 



40 



Xaa Xaa Xaa Tyr Thr Pro 

5 



20 (71) INFORMATION FOR SEQ ID NO: 70 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 16 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 70 

35 lie Ala Ser Pro lie Gin Gin Gin Gin Gin Asp Pro Thr Ala Asn 

5 10 15 

Leu 



(72) INFORMATION FOR SEQ ID NO: 71 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 
45 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
50' (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
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(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 71 

Thr Met Leu Lys Pro Arg 

5 



(73) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 72 

lie lie Glu Asp Glu Ala 

5 



(74) INFORMATION FOR SEQ ID NO: 73 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 73 

lie Thr Pro His Leu Thr Ala Ser Ala Ala Lys 

5 10 



(75) INFORMATION FOR SEQ ID NO: 74 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 



(iv) FRAGMENT TYPE: N-terminal 
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(V) SEQUENCE DESCRIPTION FOR SEQ ID 110:74 

Ser Leu Val Ala Pro Ala Pro Glu 

5 

5 

(76) INFORMATION FOR SEQ ID NO:75 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH 12 amino acids 
(B) TYPE: Amino acid 
_ XC). TOPOLOGY: Linear... 



15 



(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N- terminal 

20 (v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 75 

Lys Pro Gin Asp Leu Asp Asp Asp Pro Leu Tyr Leu 

5 10 



25 



40 



(77) INFORMATION FOR SEQ ID NO: 76 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 
30 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
35 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 7 6 



Lys Tyr Ala Leu Leu Arg 

5 



45 (78) INFORMATION FOR SEQ ID NO: 77 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
50 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

55 

(iv) FRAGMENT TYPE: N-terminal 
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(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 77 

Gin Leu Gly Asn Tyr Xaa Phe Tyr Pro Val Tyr 

5 io 



(79) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 



15 



(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

20 (v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 78 



25 



Ala Phe Glu Asp His Ser Trp Lys 

5 



(80) INFORMATION FOR SEQ ID NO: 79 



(i) SEQUENCE CHARACTERISTICS: 

0 (A) LENGTH 16 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

5 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

0 (V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 79 

Ala Gly His Ala lie Val Tyr Gly Asp Ala Thr Ser Thr Tyr Ala 

5 10 15 

Lys 

5 



(81) INFORMATION FOR SEQ ID NO: 80 

0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

5 (ii) MOLECULAR TYPE: Peptide 
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(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

5 (V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 80 

Glu Arg Leu Pro Gly Ser Tyr Tyr Lys 

5 

10 

(82) INFORMATION FOR SEQ ID NO: 81 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 
15 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
20 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 81 

25 

Thr Leu Met Glu Asp Tyr Gin 

5 



30 

(83) INFORMATION FOR SEQ ID NO: 82 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 1098 amino acids 
35 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Polypeptide 
40 (iii) HYPOTHETICAL : Yes 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 82 



45 Met Ala Leu lie Val Ala Ser Leu Phe Leu Pro Tyr Gin Pro Gin 

5 10 15 

Phe Glu Leu Asp Thr Ser Leu Pro Glu Asn Ser Gin Val Asp Ser 

20 25 30 

Ser Leu Val Asn lie Gin Ala Met Ala Asn Asp Gin Gin Gin Gin 
50 35 40 45 

Arg Ala Leu Ser Asn Asn lie Ser Gin Glu Ser Leu Val Ala Pro 

50 55 60 

Ala Pro Glu Gin Gly Val Pro Pro Ala lie Ser Arg Ser Ala Thr 
65 70 75 

55 Arg Ser Pro Ser Ala Phe Asn Arg Ala Ser Ser Thr Thr Asn Thr 

80 85 90 
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Ala Thr Leu Asp Asp Leu Val Ser Ser Asp lie Phe Met Glu Asn 

95 100 105 

Leu Thr Ala Asn Ala Thr Thr Ser His Thr Pro Thr Ser Lys Thr 

110 • 115 120 

5 Met Leu Lys Pro Arg Lys Asn Gly Ser Val Glu Arg Phe Phe Ser 

125 130 135 

Pro Ser Ser Asn lie Pro Thr Asp Arg lie Ala Ser Pro He Gin 

140 145 150 

His Glu His Asp Ser Gly Ser Arg lie Ala Ser Pro He Gin Gin 
10 155 160 165 

Gin Gin Gin Asp Pro Thr Thr Asn Leu Leu Lys Asn Val Asn Lys 

- 170 175 . _ _ 180 

Ser Leu Leu Val His Ser Leu Leu Asn Asn Thr Ser Gin Thr Ser 

185 190 195 

15 Leu Glu Gly Pro Asn Asn His lie Val Thr Pro Lys Ser Arg Ala 

200 205 210 

Gly Asn Arg Pro Thr Ser Ala Ala Thr Ser Leu Val Asn Arg Thr 

215 220 225 

Lys Gin Gly Ser Ala Ser Ser Gly Ser Ser Gly Ser Ser Ala Pro 
20 230 235 240 

• Pro Ser He Lys Arg He Thr Pro' His Leu Thr Ala Ser Ala Ala 

245 250 255 

Lys Gin Arg Pro Leu Leu Ala Lys Gin Pro Ser Asn Leu Lys Tyr 

260 265 270 

25. Ser Glu Leu Ala Asp He Ser Ser Ser Glu Thr Ser Ser Gin His* 

275 280 285 

Asn Glu Ser Asp Pro Asp Asp Leu Thr Thr Ala Pro Asp Glu Glu 

290 295 300 

Tyr Val Ser Asp Leu Glu Met Asp Asp Ala Lys Gin Asp Tyr Lys 
30 305 310 315 

Val Pro Lys Phe Gly Gly Tyr Ser Asn Lys Ser Lys Leu Lys Lys 

320 325 330 

Tyr Ala Leu Leu Arg Ser Ser Gin Glu Leu Phe Ser Arg Leu Pro 

335 340 345 

35 Trp Ser He Val Pro Ser He Lys Gly Asn Gly Ala Met Lys Asn 

350 355 360 

Ala He Asn Thr Ala Val Leu Glu Asn He He Pro His Arg His 

365 370 . 375 

Val Lys Trp Val Gly Thr Val Gly He Pro Thr Asp Glu He Pro 
40 380 385 390 

Glu Asn He Leu Ala Asn He Ser Asp Ser Leu Lys Asp Lys Tyr 

395 400 405 

Asp Ser Tyr Pro Val Leu Thr Asp Asp Asp Thr Phe Lys Ala Ala 

410 415 420 

45 Tyr Lys Asn Tyr Cys Lys Gin He Leu Trp Pro Thr Leu His Tyr 

425 430 435 

Gin He Pro Asp Asn Pro Asn Ser Lys Ala Phe Glu Asp His Ser 

440 445 450 

Trp Lys Phe Tyr Arg Asn Leu Asn Gin Arg Phe Ala Asp Ala He 
50 455 460 465 

Val Lys He Tyr Lys Lys Gly Asp Thr He Trp He His Asp Tyr 

470 475 480 

His Leu Met Leu Val Pro Gin Met Val Arg Asp Val Leu Pro Phe 

485 490 495 

55 Ala Lys He Gly Phe Thr Leu His Val Ser Phe Pro Ser Ser Glu 

500 505 510 
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Val Phe Arg Cys Leu Ala Gin Arg Glu Lys lie Leu Glu Gly Leu 
} 515 520 525 

Thr Gly Ala Asp Phe Val Gly Phe Gin Thr Arg Glu Tyr Ala Arg 1 

530 535 540 

5 His Phe Leu Gin Thr Ser Asn Arg Leu Leu Met Ala Asp Val Val 

545 550 555 

His Asp Glu Glu Leu Lys Tyr Asn Gly Arg Val Val Ser Val Arg 

560 565 570 
Phe Thr Pro Val Gly lie Asp Ala Phe Asp Leu Gin Ser Gin Leu 
10 575 580 585 
Lys Asp Gly Ser Val Met Gin Trp Arg Gin Leu lie Arg Glu Arg 
590 595 goo 

Trp Gin Gly Lys Lys Leu lie Val Cys Arg Asp Gin Phe Asp Arg 

605 610 615 

15 lie Arg Gly lie His Lys Lys Leu Leu Ala Tyr Glu Lys Phe Leu 

620 625 630 

Val Glu Asn Pro Glu Tyr Val Glu Lys Ser Thr Leu lie Gin lie 

635 640 645 

Cys lie Gly Ser Ser Lys Asp Val Glu Leu Glu Arg Gin lie Met 
20 650 655 660 

lie Val Val Asp Arg lie Asn Ser Leu Ser Thr Asn lie Ser lie 

665 670 675 

Ser Gin Pro' Val Val Phe Leu His Gin Asp Leu Asp Phe Ser Gin 

680 685 690 

25 Tyr Leu Ala Leu Ser Ser Glu Ala Asp Leu Phe Val Val Ser Ser 

695 700 705 

Leu. Arg Glu Gly Met Asn Leu Thr Cys His Glu Phe He Val Cys 

710 < 715 720* 

Ser Glu Asp Lys Asn Ala Pro Leu Leu Leu Ser Glu Phe Thr Gly 
30 725 730 735 

Ser Ala Ser Leu Leu Asn Asp Gly Ala He He He Asn Pro Trp 

740 745 750 

Asp Thr Lys Asn Phe Ser Gin Ala He Leu Lys Gly Leu Glu Met 

755 760 765 

35 Pro Phe Asp Lys Arg Arg Pro Gin Trp Lys Lys Leu Met Lys Asp 

. 770 775 780 

He He Asn Asn Asp Ser Thr Asn Trp He Lys Thr Ser Leu Gin 

785 790 795 

Asp He His He Ser Trp Gin Phe Asn Gin Glu Gly Ser Lys He 
40 800 N805 810 

Phe Lys Leu Asn Thr Lys Thr Leu Met Glu Asp Tyr Gin Ser Ser 

815 820 825 

Lys Lys Arg Met Phe Val Phe Asn He Ala Glu Pro Pro Ser Ser 

830 835 840 

45 Arg Met He Ser He Leu Asn Asp Met Thr Ser Lys Gly Asn He 

845 850 855 

Val Tyr He Met Asn Ser Phe Pro Lys Pro He Leu Glu Asn Leu 

860 865 870 

Tyr Ser Arg Val Gin Asn He Gly Leu He Ala Glu Asn Gly Ala 
50 875 880 885 

Tyr Val Ser Leu Asn Gly Val Trp Tyr Asn He Val Asp Gin Val 

890 895 900 

Asp Trp Arg Asn Asp Val Ala Lys He Leu Glu Asp Lys Val Glu 

905 910 915 

55 Arg Leu Pro Gly Ser Tyr Tyr Lys He Asn Glu Ser Met He Lys 

920 925 930 
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(84) INFORMATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS 
30 (A) LENGTH 5981 basepairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Doublestranded 
• (D) TOPOLOGY : Linear 

35 (ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

40 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomvces cerevisiae 

(B) STRAIN: S288C 

(C) HAPLOTYPE: Haploid 

45 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Genomic 

(B) CLONES: 6 and 10 

(viii) POSITION IN GENOME 
(A) CHROMOSOME: 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
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GGCTCACATT CCAAAAAGAA CAGTTCGAAC GATAAAGCTT TTCACCGCTG 50 
GTGGTGATAA CAAGGTTAGG ATATGGAAGT TGAACAGAGA TGAAAACGGA 100 
CAAAATGGGG GGGTGCGTAA GATTGAAAGC CTTGACTTTC TTGGCTCGTG 150 
ACGCATCACG AACAGGCCAT AAATGTAATC CGATTCAACT CGAAGGGTGA 200 
5 CGTACTGGCG TCTGCGGGCG ATGACGGCCA AGTGCTGCTA TGGAAGCAAG 250 
AAGAACCAAA TACACAGCAA GAATCTGTGG TCAGACCATT CGGAATGGAT 3 00 
GCGGAGACTA GTGAAGCAGA CGAGAACAAG GAGAAATGGG TTGTGTGGAA 350 
ACGGCTGCGT GGTGGTAGCG GTGCTACTGC GGCGGCAGAG ATTTACGATC 400 
TAGCGTGGTC ACCTGATAAC AGGAACATAG TGGTGGCATG TATGGACAAT 450 
10 TCGATACGAC TGTTCGATGT TGGAGCTGGG ATGCTGGTAT GCGGCCAGTC 500 
GGATCATGGT CACTACGTCC AAGGTCTTGC ATGGGACCCA TTAAATCAGT 550 
TTATTCTCTC ACAGTCTGCG GACCGGTCTC TGCATGTATA TGGAGTCATT 600 
CTTTCATCTG CAGGAGTTAG TTACAGGGCT TGAAACTTTA GAAGTAAGGT 650 
TGCCAAGGCA GAACTGCCTT GTCCAGGTGA TGTCCTGAGG ACAAATTACT 700 
15 TTTTCACAAC GAGACGCTAC CTTCATTCTT TAGGCGATGC AGCATATCGC 750 
CTTGTGGTGG TTTGGTCGTA ATTCCCAGTG GTGTGTATAA GGTGGCTGGT 800 
GATGAAGTCG CGAACTGCGT ATACGTGTAT ACTAGATCTG GAATACTGAA 850 
CAGCGCTGGT GGCGTTAAAA ACCGGCCTGC GATTAGAATC CCATCTTTGA 900 
AGAAACCAGC GCTGATGGCG GCTTTCTCGC CCGTATTTTA CGAAACGTGC 950 
20 CAGAAGAGTG TGCTTAAGCT GCCCTATAAG CTAGTATTTG CCATAGCAAC 1000 
GACTAACGAA GTACTCGTGT ACGACACGGA TGTATTGGAG CCGTTATGCG 1050 
TTGTGGGAAA TATACATTAC TCACCCATAA CTGATTTAGC ATGGTCTGAG 1100 
GATGGCTCGA CCCTACTAAT CTCATCAACA GACGGATTCT GTTCGTATGT 1150 
ATCGATCGAC ACAGAAACGC AATTCGGTTC AAGGATAGAG CCGCCAGCGA 1200 
25 TGCATGCAGA GCCACTAGAC ACTGACGAGA GCGCGGTAGC GGCTAAGAAC 1250 
CAGCGCGAGG CAGGTGGGAT CGTGAACATG CTGCCGGTGA AGAAGATCCC 1300 
CTGCAATAGT AGCGATAGTA AAAAGAGGCG CATACATCCT ACGCCAGTCG 1350 
ATTTGTGATT TTTAATATAT TTAATGCGGT ACATAAGAAT GCCTAATCTA 14 00 
GTTTGCCAGC GAAGATATTT TCCATTGTGT GCTCAATGGA CCCTGTGTTC 1450 
30 CTGAGATCTT GCACGACTTT TTCCTGAGGC ACATGTGGCG CCATCGTCAT 1500 
CACGATTTCA ACCATGTCAC TATTAACGGC TCTCTTTCGA TCACTACCAT 1550 
GTCTGTTTAA CCGAGCAACG CGTTCCTCCG GAGCCGATGG TACTGGCTCC 1600 
GGAGAAGGGT CGTTGGTGGC GACCGAGGGC GCCGGTTTGG CATCCTGTAC 1650 
GGTCCGCAAG GGTACTTGCT TGGCGCCCCT GTGTTTCACG GTGTAAACAA 1700 
35 ACAAGCACAC CATCGTCAGT ATAAAGCACT ATAGTCGAAC CATCCATTTT 1750 
TACTTTTGTG CGCGTGGGTA GCCGTGCCTC GTCTGTGTGT GTGGGAATGT 1800 
ATAAATGTGT CCCGAGTTAT TATTCTAAAG CGGGCACCAT TGTAGTAACT 1850 
TATTGCGAAA TTTCTGCTCT TCTCGTCTCG CTCAAAAATC GCGTTCAGGG 1900 
TAAAAGGGGC GAAACAGAGG GCCAGATAGA AATTTCGAGA AAACGGGTCA 1950 
40 CCCCGCCCCT GCATTTTGAT ATGGCGTATT TGGGATTGCT TGCTCGAAAG 2000 
TGTCTAAGTC CGGCTGGCGG GCCTGGCGCC CTCGCCGAAG GGAGATAGGA 2050 
AGGGGCGGGG GTCCGGGCAG CGGCTATGGT GTCAGTTACC TAGGGAAGGA 2100 
GAAGGGGGTA GAACCAAGGG GCTAGCACAC TCACCCTGGG GCCCCCGTCT 2150 
AGCCAAGCTT AAATATAAAT ACTAATGTAA CTATAAATAT AAGGATCTAC 2200 
45 CGTGTCATTG CACATCCACC CACCCGTCGA TTAAAAAACC AAACAAAGCA 2250 
AAGAATACAA TAGCAACGCA AGATCAACAC AATGGCTCTC ATCGTGGCAT 2300 
CTTTGTTTTT GCCCTACCAA CCACAATTCG AGCTTGACAC CTCTCTCCCT 2350 
GAGAACTCGC AGGTGGACTC ATCTCTCGTG AACATCCAGG CTATGGCCAA 2400 
TGACCAACAG CAACAACGTG CGCTTTCTAA CAACATCTCA CAGGAATCAT 2450 
50 TGGTCGCGCC AGCACCAGAA CAAGGTGTCC CCCCAGCAAT CTCAAGGAGT 2500 
GCCACCAGGT CACCCAGTGC TTTCAACCGC GCCTCGTCTA CGACAAATAC 2550 
TGCCACTTTA GATGATCTTG TCTCTTCGGA CATATTCATG GAAAACTTGA 2 600 
CTGCGAATGC AACTACCTCA CATACGCCAA CAAGCAAGAC TATGCTTAAA 2650 
CCCCGGAAAA ATGGTTCCGT GGAACGATTC TTCTCCCCTT CTTCCAATAT 2700 
55 TCCCACGGAT CGCATCGCAT CGCCAATCCA GCATGAGCAT GACTCCGGTT 2750 
CGAGAATTGC TTCGCCAATC CAACAGCAAC AGCAGGACCC CACGACCAAC 2800 



WO 93/17093 



PCT/FI93/<i0a<9 



119 



TTATTAAAGA ACGTCAACAA GTCATTGTTA GTGCACTCAC TGTTGAACAA 2850 
CACCTCACAA ACTAGCCTAG AAGGACCCAA CAACCACATT GTTACCCCGA 2900 
AATCGAGGGC GGGCAACAGG CCTACTTCGG CGGCTACTTC TTTAGTTAAT 2 950 
AGGACCAAAC AAGGTTCGGC CTCCTCTGGA TCTTCTGGGT CTTCTGCGCC 3 000 
5 ACCTTCCATT AAAAGGATTA CGCCCCACTT GACTGCGTCT GCTGCAAAAC 3050 
AGCGTCCCTT ATTGGCTAAA CAGCCTTCTA ATCTGAAATA TTCGGAGTTA 3100 
GCAGATATTT CGTCGAGTGA GACGTCTTCG CAGCATAATG AGTCGGACCC 3150 
GGATGATCTA ACTACTGCCC CTGACGAGGA ATATGTTTCT GATTTGGAAA 3 200 
TGGATGACGC GAAGCAGGAC TACAAGGTTC CAAAGTTCGG CGGCTATTCC 3250 
10 AATAAATCTA AACTTAAGAA ATATGCGCTG TTAAGGTCAT CTCAGGAGCT 3300 
GTTTAGCCGT CTTCCATGGT CGATCGTTCC CTCTATCAAA GGTAATGGCG 3350 
CCATGAAGAA CGCCATAAAC ACTGCAGTCT . TGGAGAATAT- CATTCCGCAG 3400 
CGTCATGTTA AGTGGGTCGG TACCGTCGGA ATCCCAACGG ATGAGATTCC 3450 
GGAAAATATC CTTGCGAACA TCTCTGACTC TTTAAAAGAC AAGTACGACT 3500 
15 CCTATCCTGT CCTTACGGAC GACGACACCT TCAAAGCCGC ATACAAAAAC 3550 
TACTGTAAAC AAATCTTGTG GCCTACGCTG CATTACCAGA TTCCAGACAA 3600 
TCCGAACTCG AAGGCTTTTG AAGATCACTC TTGGAAGTTC TATAGAAACT 3650 
TAAACCAAAG GTTTGCGGAC GCGATCGTTA AAATCTATAA GAAAGGTGAC 3700 
ACCATCTGGA TTCATGATTA CCATTTAATG CTGGTTCCGC AGATGGTGAG 3750 
20 AGACGTCTTG CCTTTTGCCA AAATAGGATT TACCTTACAT GTCTCGTTCC 38 00 
CCAGTAGTGA AGTGTTTAGG TGTCTGGCTC AGCGTGAGAA GATCTTAGAA 3850 
GGCTTGACCG GTGCAGACTT TGTCGGCTTC CAGACGAGGG AGTATGCAAG 390O 
ACATTTCTTA CAGACGTCTA ACCGTCTGCT AATGGCGGAC GTGGTACATG 3950 
ATGAAGAGCT AAAGTATAAC GGCAGAGTCG TTTCTGTGAG GTTCACCCCA 4O0O 
25 GTTGGTATCG ACGCCTTTGA TTTGCAATCG CAATTGAAGG ATGGAAGTGT 405O 
CATGCAATGG CGTCAATTGA TTCGTGAAAG ATGGCAAGGG AAAAAACTAA 4100 
TTGTGTGTCG TGATCAATTC GATAGAATTA GAGGTATTCA CAAGAAATTG 4150 
TTGGCTTATG AAAAATTCTT GGTCGAAAAT CCGGAATACG TGGAAAAATC 4200 
GACTTTAATT CAAATCTGTA TTGGAAGCAG TAAGGATGTA GAACTGGAGC 4250 
30 GCCAGATCAT GATTGTCGTG GATAGAATCA ACTCGCTATC CACCAATATT 43 00 
AGTATTTCTC AACCTGTGGT GTTTTTGCAT CAAGATCTAG ATTTTTCTCA 4350 
GTATTTAGCT TTGAGTTCAG AGGCAGATTT GTTCGTAGTC AGCTCTCTAA 44 00 
GGGAAGGTAT GAACTTGACA TGTCACGAAT TTATCGTTTG TTCTGAGGAC 4450 
AAAAATGCTC CCCTACTGTT GTCAGAATTT ACTGGTAGTG CATCTTTATT 4500 
35 GAATGATGGC GCTATAATAA TTAACCCATG GGATACCAAG AACTTCTCAC 4550 
AAGCCATTCT CAAGGGGTTG GAGATGCCAT TCGATAAGAG AAGGCCACAG 4600 
TGGAAGAAAT TGATGAAAGA CATTATCAAC AACGACTCTA CAAACTGGAT 4650 
CAAGACTTCT TTACAAGATA TTCATATTTC GTGGCAATTC AATCAAGAAG 4700 
GTTCCAAGAT CTTCAAATTG AATACAAAAA CACTGATGGA AGATTACCAG 4750 
40 TCATCTAAAA AGCGTATGTT TGTTTTCAAC ATTGCTGAAC CACCTTCATC 48 00 
GAGAATGATT TCCATACTGA ATGACATGAC TTCTAAGGGC AATATCGTTT 4850 
ACATCATGAA CTCATTTCCA AAGCCCATTC TGGAAAATCT TTACAGTCGT 4900 
GTGCAAAACA TTGGGTTGAT TGCCGAGAAT GGTGCATACG TTAGTCTGAA 4950 
CGGTGTATGG TACAACATTG TTGATCAAGT CGATTGGCGT AACGATGTAG 5000 
45 CCAAAATTCT CGAGGACAAA GTGGAGAGAT TACCTGGCTC GTACTACAAG 5050 
ATAAATGAGT CCATGATCAA GTTCCACACT GAAAATGCGG AAGATCAAGA 5100 
TCGTGTAGCT AGTGTTATCG GTGATGCCAT CACACATATC AATACTGTTT 5150 
TTGACCACAG AGGTATTCAT GCCTACGTTT ACAAAAACGT TGTTTCCGTA 52 00 
CAACAAGTGG GACTTTCCTT ATCGGCAGCT CAATTTCTTT TCAGATTCTA 5250 
50 TAATTCTGCT TCGGATCCAC TGGATACGAG TTCCGGCCAA ATCACAAATA 53 00 
TTCAGACACC ATCTCAACAA AATCCTTCAG ATCAAGAACA ACAACCTCCA 5350 
GCCTCTCCCA CTGTGTCGAT GAACCATATT GATTTCGCAT GTGTCTCTGG 54 00 
TTCATCGTCT CCTGTGCTTG AACCATTGTT CAAATTGGTC AATGATGAAG 5450 
CAAGTGAAGG GCAAGTAAAA GCCGGACACG CCATTGTTTA TGGTGATGCT 5500 
55 ACTTCTACTT ATGCCAAAGA ACATGTAAAT GGGTTAAACG AACTTTTCAC 5550 
GATCATTTCA AGAATCATTG AAGATTAAAT TTTACCATTT TAAAATTTTA 5600 
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ATGTTCTTGG GTATGAACTT TTATTTTCAA CTGCTTATTA TATATCAATT 5650 
CTATAAATTT TTTTCTTCTC TCTAACGACC AATTATAAAA TTCATCCTCT 5700 
TATTTATTAC AGCATCTTAT ACATTATGTA TATGGGTAGC TATTATTCAT 5750 
TTTTGCTTCG TAAGGACTTT TTTTGTCAAC TTTTTCATCC TAAGCGGCTA 5800 
5 AAAGTGATTG GAGAGGAATG TCCAGGCGAC CAATGATAAA AACGCTTTCT 5850 
CTTGGAACAA GAAATAGGAG CAATTGACAG TTGTCGATGA ACAGCGAAAA 5900 
TAGTAAGATA ACCTTCAAGC CCAATATTCT AATTAAAGGC GTTTATATAT 5950 
TTGTACTTTA TGGTATGTGC ATATGTATTG T 5981 



(85) INFORMATION" FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH50 base pairs 
15 (B) TYPE: Nucleotide 

(C) STRANDEDNESS : Doublestranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:84 



CGGGAAGACA TAGAACTATG ACTACGGATA ACGCTAAGGC GCAACTGACC 50 



(86) INFORMATION FOR SEQ ID NO: 85 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH48 basepairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: Doublestranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:85 



GGGCCCAACA ACACAATGGT TACCCCGAAA TCGAGGGCGG GCAACAGG 48 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuJc I3bis) 



A. Tbe indications made below relate to tbe microorganism referred to in the description 
on page 25 Jine 16 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet j^j 



Name of depositary institution 
DSM-DEUTSCHE SAMMLUNG VON MIKROORGANISMEIN UND ZELLKULTUREN GmbH 



| Address of depositary institution (including postal code and country) 

Mascheroder Weg 1 B, D-3300 Braunschweig, Federal Republic of Germany 



Date of deposit 



18 February 1992 



Accession Number 



DSM 6928 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) ITjis information is continued on an additional sheet Q 



In respect of those designations in which a European patent is sought, 
a sample of the deposited microorganism will be made available only by 
the issue of such a sairple to an expert nominated by the person requesting 
the sairple (Rule 28(4) EPC) until the publication of the mention of "the 
. grant of the European patent or until the date on which the application 
has been refused or is deemed to be withdrawn. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications arc not for all designated States* 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



'Accession 



For receiving Office use only 



IT] "Hris sheet was received with tbe international application 



AutbenzRftjfEcer 



F rm PCmo/134 (July 1992) 



For International Bureau use only 



n Tnis sheet was received by the International Bureau 



on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuleLJto) 



A. The indications made beiow relate to the microorganism referred to in trie description 
25 .line 28 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



Address of depositary institution {including postal code and caumry) 

Mascheroder Weg 1 B, D-330Q Braunschweig, Federal Republic of Germany 



Date of deposit 

25 January 1993 


Accession Number 

DEM 7425 - 


C. ADDITIONAL INDICATIONS (leave bknkif not applicable) This iniormauon is continued on an addttionai sheet [ | 



In respect of those designations in which a European patent is sought, 
a sample of the deposited microorganism will be made available only by 
the issue of such a sample to an expert nominated by the person requesting 
the sanple (Rule 28(4) EPC) until the publication of the mention of the 
grant of the European patent or until the date on which the application 
has been refused or is deemed to be withdrawn. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for aii daipiated Stoics) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications iistedbeiow will besuommed to the international Bureau i&icf{spcctfvuici&neratnaiurccftneindu^ 'Accession 
Numver of Deposit') 



For receiving Office use only 



l^Q ^b* 5 snect was received with the interna tionai application 




Far Internationa] Bureau use oniv 



n This sheet was received by th International Bureau < 



Autbonzed ofticer 



WO 93/1/0*) 
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CLAIMS: 

1. Genes, which comprise at least one of the isolated and ' 
purified DNA sequences coding for the short chain of about 57 

5 JcDa and the long chains of about 99 kDa and about 123 kDa of 
trehalose synthase and functional equivalents thereof. 

2. The genes of claim 1, wherein the isolated and purified DNA 
sequence encodes the "amino - acid sequences T of the~shbrt chain ^>f 

10 about 57 kDa having the SEQ ID NO: 2 or functional equivalents 
thereof . 

3. The genes of claim 1 wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 

15 about 123 kDa comprising the SEQ ID NO: 4 or functional 
equivalents thereof, 

4. The genes of claim 1 wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 

20 about 123 kDa comprising the SEQ ID NO: 82 or functional 
equivalents thereof. 

5. The genes of claim 1 wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 

25 about 99 kDa having the SEQ ID NO: 29 - 38 and 44 - 49 or 
functional equivalents thereof. 

6. The genes of claim 1, which are selected from the group 
consisting of TSS1, TSLl and TSL2 and functional equivalents 

30 thereof. 

7. The genes of claim 6, wherein TSS1 comprises the open 
reading frame of SEQ ID NO: 1 or functional equivalents 
thereof. 

35 

8. The genes of claim 6, wherein TSLl comprises the open 
reading frame of SEQ ID NO: 3 or functional equivalents thereof. 



WO 93/17093 



124 

9. The g nes of claim 6, wherein TSL1 comprises the open 
reading frame SEQ ID NO: 83 or functional equivalents thereof. 

10. The genes of claim 6, wherein TSL2 comprises the DNA 
sequences inserted into pALK756 (DSM7425) or functional 
equivalents thereof. 

11. A truncated TSL1 gene encoding a truncated form of the 12 
kDaT Xong"bfia"in of trehalose synthase lacking up to i 600* amino 
acids from one end preferably the N-terminus. 

12. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks up to 33 0 amino acids from the N-terminus end. 

13. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 250 - 450 amino acids from the N-terminus end. 

14. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 250 - 450, preferably about 325 amino acids from 
the N-terminus end. 

15. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 209 amino acids from the N-terminus end. 

16. A vector which comprises at least one of the genes of 
claims 1-15. 

17. A DNA construct which comprises at least one of the genes 
of claims 1-15. 

18. Host cells or organisms transformed with at least one of 
the vectors of claim 16. 

19. Host cells or organisms transformed with at least one of 
DNA constructs of claim 17. 

20. Host cells or organisms transformed with the truncated 
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TSL1 gene of claims 10 - 15. 

21. The transformed host cells or organisms of claims 18 or 
19, which express a trehalose synthase which exhibits 
trehalose-6-phosphate synthase activity activatable by 
fructose-6-phosphate and also trehalose-6-phosphatase activity. 

22. The transformed host cells or organisms of claim 20, which 
express a trehalose synthase that is less inhibited by 
phosphate than is the intact trehalose synthase. 

23. The transformed host cells or organisms of any of claims 
18 to 22, which are selected from a group consisting of plants, 
fungi, yeasts or bacteria. 

24. The transformed host cells or organisms of claims 23, 
wherein the yeast is Saccharomvces cerevisiae . 

25. The transformed host cells or organisms of any of claims 18 
to 22, wherein parts of the organism have increased trehalose 
content as compared to the corresponding parts of the parent 
cell or organism when grown under the same conditions. 

26. The transformed host cells or organisms of any of claims 18 
to 22, wherein said cells or organisms produce a higher yield 
of ethanol from carbohydrate than the parent cell or organism. 

27. The transformed host cells or organisms of any of claims 
claims 18 to 22, wherein said cells or organisms are more 
resistant to heat, cold and water deprivation than are the 
parent cell or organism. 

28. A trehalose synthase produced by recombinant DNA techniques 
using at least one of the genes of claims 1 to 15. 

29. A substantially purified trehalose synthase, which exhibits 
trehalose-6 -phosphate synthase activity activatable by 
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fructose-6-phosphate and also trehalose-6-phosphatase activity. 

30. A substantially purified trehalose synthase, which exhibits 
trehalose-synthase activity less sensitive to inhibition by 
phosphate. 

31. The trehalose synthase of claims 28 to 30, which comprises 
one short chain of about 57 kDa and at least one of two long 
chains' of about 99' JcDa and 123 kDa or truncated forms of the 
123 kDa chain. 

32. The trehalose synthase of claim 3l f wherein the short 
chain comprises the amino acid sequence of SEQ ID NO: 2 or 
functional equivalents thereof. 

33. The trehalose synthase of claim 31, wherein the 123 kDa 
long chain comprises the amino acid sequence of SEQ ID NO: 4 or 
functional equivalents thereof. 

34. The trehalose synthase of claim 31, wherein the 123 kDa 
long chain comprises the amino acid sequence of SEQ ID NO: 82 
or functional equivalents thereof. 

35. The trehalose synthase of claim 31, wherein the 99 kDa long 
chain comprises a majority, preferably at least ten of the 
amino acid sequences SEQ ID N0:s 29 to 38 and 44 to 49 or 
functional equivalents thereof. 

36. The trehalose synthase of claim 31, wherein the 99 kDa long 
chain comprises the amino acid sequences encoded by the DNA 
sequence inserted into pALK756 (DSM7425) or functional 
equivalents thereof. 

37. A trehalose-6-phosphate synthase, which comprises the 57 
kDa polypeptide corresponding to the short chain of claim 31 
and having the amino acid sequence of SEQ ID. NO: 2 or 
functional equivalents thereof. 
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38. A trehalose-6-phosphate phosphatase, which comprises an 
about 99 kDa polypeptide corresponding to the long chain of 
claims 33 - 34. 

39. A process for producing ethanol by using the host cells or 
organisms of claims 22 or 23, wherein the yield of ethanol or 
its rate of production is greater than by using the 
untransformed cell or organism. 

40. The process of claim 39, wherein the organism is 
distiller's yeast. 



41. The process of claim 39, wherein at least one of the DNA 
sequences of claims 1-15 are functionally combined with 

15 promoters active under fermentative conditions. 

42. A process for producing a crop plant which has increased 
resistance to water deprivation, heat and cold, comprising 
transforming the plant by introducing at least one of the genes 
of claims 1-15 into said plant's tissue. 
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43. A process for producing trehalose by cultivating a host or 
organism which has been transformed with at least one of the 
genes of claims 1-15. 

44. The process according to claim 43, wherein the transformed 
host cell or organism is a fungus, yeast or plant. 

45. The process according to claim 44, wherein the yeast is 
30 Saccharomvces cerevisiae . 

46. A process for producing trehalose enriched food products 
from plants by introducing at least one of the genes of claims 
1-15 and allowing said genes to express the trehalose 

35 synthase in the edible tissues of the plant. 



47. The process of claim 46, wherein the trehalose is purified 
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by conventional methods. 

48. The process of claim 46, wherein the trehalose is used as 
such or as an unpurified homogenate. 

49. A method for selecting strains which have been transformed 
with at least one of the genes of claims 1-15, wherein the 
selection medium comprises galactose as the main carbon source. 
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Promoter 

TTGTTGCGAT TGTTCTGTTC CATCTGCACC AGAAC&AAGA ACAAAAGAAC -396 
AAGGAACAAA GTCCAAGCAC GTCAGCGCTG TTTATAAGGG GATTGACGAG -346 
GGATCGGGCC TAGAGTGCCA GCGCGCCAGG GAGAGGGAGC CCCCTGGGCC -296 
CTCATCCGCA GGCTGATAGG GGTCACCCCG_£TGGGCAGGT CAGGGCAGGG -246 
GCTCTCAGGG GGGCGCCATG GACAAACTGC ACTGAGGTTC TAAGACACAT -196 
GTATTATTGT GAGTATG TAT ATATAG AGAC AGATTAAGGC GTACAGCCGG -146 
GTGGTAGAGA TTGATTAACT TGGTAGTCTT ATCTTGTCAA TTGAGTTTCT -96 
GTCAGTTTCT TCTTGAACAA GCACGCAGCT AAGTAAGCAA CAAAGCAGGC -46 
TAACAAACTA GGTACTCACA TACAGACTTA TTAAGACATA GAACT ATG +3 

Terminator 

ACC AAA AAC TGA TGA ACCCGATGCA AATGAGACGA TCGTCTATTC +1521 
CTGGTCCGGT TTTCTCTGCC CTCTCTTCTA TTCACTTTTT TTATACTTTA +1571 
TATAAAATTA TATAAATGAC ATAACTGAAA CGCCACACGT CCTCTCCTAT +1621 
TCGTTAACGC CTGTCTGTAG CGCTGTTACT GAAGCTGCGC AAGTAGTTTT +1671 
TTCACCGTAT AGGCC +1686 

FIG. 3A 
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Promoter 

TGTAGTAACT TATTG CGAAA TTTC TCCTCT TCTCGTCTCG CTCAAAAATC -392 
GCGTTCAGGG TAAAAGGGGC CAAAPacar.r: GCCAGATAGA_AATTTCGAGA -342 
" "AraCGGGTCA~CCCCGCCCCT GCATTTTGAT ATGGCGTATT TGGGATTGCT ~292 
TGCTCGAAAG TGTCTAAGTC CGGCTGGCGG GCCTGGCGCC CTCGCCGAAG -242 
GGAGATAGGA AGGGGCGGGG GTCCGGGCAG CGGCTATGGT GTCAGTTACC -192 
TAGGGAAGGA GAAGGGGGTA GAACCMS£G_GCTAGCACAC TCACCCTGGG -142 
GCCCCCGTCT AGCCAAGCTT AAAI^TAAAT ACTAATGTAA CTJlTAAATAT -92 
AAGGATCTAC CGTGTCATTG CACATCCACC CACCCGTCGA TTAAAAAACC -42 
AAACAAAGCA AAGAATACAA TAGCAACGCA AGATCAACAC A ATG GCT +6 

Terminator 

TTC ACG ATC ATT TCA AGA ATC ATT GAA GAT TAA ATTTTACCAT +3307 
TTTAAAATTT TAATGTTCTT GGGTATGAAC TTTTATTTTC AACTGCTTAT +3357 
TATATATCAA TTCTATAAAT TTTTTTCTTC TCTCTAACGA CCAATTATAA +3407 
AATTCATCCT CTTATTTATT ACAGCATCTT ATACATTATG TATATGGGTA +3457 
GCTATTATTC ATTTTTGCTT CGTAAGGACT TTTTTTGTCA ACTTTrTCAT +3507 
CCTAAGCGGC TAAAAGTGAT TGGAGAGGAA TGtccaggcg accaatgata +3557 
aaaacgcttt ctcttggaac aagaaatagg agcaattgac agttgtcgat +3607 
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Promoter 

TGTAGTAACT TATTG CGAAA TTTC TGCTCT TCTCGTCTCG CTCAAAAATC -392 
GCGTTCAGGG TAAAAGGGGC GAAAPAflagr: GCCAGAT AGA AATTTC SAGA -342 
- AAACGGGTCA CCCCGCCCCT GCATTTTGAT ATGGCGTATT " TGGGATTGCT ~-292 
TGCTCGAAAG TGTCTAAGTC CGGCTGGCGG GCCTGGCGCC CTCGCCGAAG -242 
GGAGATAGGA AGGGGCGGGG GTCCGGGCAG CGGCTATGGT GTCAGTTACC -192 
TAGGGAAGGA GAAGGGGGTA GAAC CAAGGG G CTAGCAPAf TCACCCTGGG -142 
GCCCCCGTCT AGCCAAGCTT AA ATATAAA T ACTAATGTAA CTATAAA TAT -92 
AAGGATCTAC CGTGTCATTG CACATCCACC CACCCGTCGA TTAAAAAACC -42 
AAACAAAGCA AAGAATACAA TAGCAACGCA AGATCAACAC A ATG GCT +6 

Terminator 

TTC ACG ATC ATT TCA AGA ATC ATT GAA GAT TAA ATTTTACCAT +3 307 
TTTAAAATTT TAATGTTCTT GGGTATGAAC TTTTATTTTC AACTGCTTAT +3357 
2A2A2&TCAA TTCTATAAAT TTTTTTCTTC TCTCTAACGA CCAATTATAA +3407 
AATTCATCCT CTTATTTATT ACAGCATCTT ATACATTATG TATATGGGTA +3457 
GCTATTATTC ATTTTTGCTT CGTAAGGACT TTTTTTGTCA ACTTTTTCAT +3507 
CCTAAGCGGC TAAAAGTGAT TGGAGAGGAA TGtccaggcg accaatgata +3557 
aaaacgcttt ctcttggaac aagaaatagg agcaattgac agttgtcgat +3607 
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